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PESTICIDAL AGENTS 

The present invention relates to materials, agents and 
5 compositions having pesticidal activity which derive from 
bacteria , and more particularly from Xenorhabdus species* 
The invention further relates to organisms and methods 
employing such compounds and compositions. 

10 There is an ongoing requirement for materials, agents, 
compositions and organisms having pesticidal activity, 
for instance for use in crop protection or insect- 
mediated disease control. Novel materials are required 
to overcome the problem of resistence to existing 

15 pesticides. Ideally such materials are cheap to produce, 
stable, have a high toxicity (either when used alone or 
in combination) and are effective when taken orally by 
the pest target. Thus any invention which provided 
materials, agents, compositions or organisms in which any 

20 of these properties was enhanced would represent a step 
forward in the art. 

Xenorhabdus spp. in nature are frequently symbiotically 
associated with a nematode host, and it is known that 
25 this association may be used to control pest activity. 
For instance, it is known that certain Xenorhabdus spp. 
alone are capable of killing an insect host when injected 
into the host's hemocoel. 

30 in addition, one extracellular insecticidal toxin from 
Photorhabdus luminescens has been isolated (this species 
was recently removed from the genus Xenorhabdus , and is 
closely related to the species therein). This toxin" is 
not effective when ingested, but is highly toxic when 

35 injected into certain insect larvae (see Parasites and 
Pathogens of Insects Vol.2, Eds. Beckage, N. E. et 
al., Academic Press 1993). 
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Also known are certain low-molecular weight heterocyclic 
compounds from P . luminescens and X.nematophilus which 
have antibiotic properties when applied intravenously or 
topically (see Rhodes, S.H. et al., PCT WO 84/01775). 

Unfortunately none of these prior art materials have the 
ideal pesticide characteristics discussed above, and in 
particular, they do not have toxic activity when 
administered orally. 



The present invention provides pesticidal agents and 
compositions from Xenorhabdus species, organisms which 
produce such compounds and compositions, and methods 
which employ these agents, compositions and organisms, 
15 that alleviate some of the problems with the prior art . 

According to one aspect of the present invention there is 
disclosed a method of killing or controlling insect pests 
comprising administering cells from Xenorhabdus species 
20 or pesticidal materials derived or obtainable therefrom, 
orally to the pests. 

A PCT application of CSIRO published as WO 95/00647 
discloses an apparently toxic protein from Xenorhabdus 
25 nematophllus; however no details of the protein's 

toxicity are given, and certainly there is no disclosure 
of its use as an oral insecticide. 



30 



35 



Thus the invention provides an insecticidal composition 
adapted for oral administration to an insect, which 
composition comprises a pesticidal material obtainable 
from a Xenorhabdus species, or a pesticidal fragment 
thereof, or a pesticidal variant or derivative of either 
of these. 

The composition may in fact comprise cells of Xenorhabdus 
or alternatively supernatant taken from cultures of cells 
of Xenorhabdus species. However, the composition 



SUBSTITUTE SHEET (RULE 26) 



BN8DOCID: <WO_JB80838SA1JL> 



WO 98/08388 




PCT/GB97/02284 



preferably comprises toxins isolable from Xenorhabdus as 
illustrated hereinafter. Toxic activity has been 
associated with material encoded by the nucleotide 
sequence of Figure 2. Thus, the composition suitably 
5 comprises a pesticidal material which is encoded by all 
or part of the nucleotide sequence of Figure 2. 
Pesticidal fragments as well as variants or derivatives 
of such toxins may also be employed. 

10 The sequence of Figure 2 is of the order of 40kb in 

length. It is believed that this sequence may encode more 
than one protein, each of which may regulate or be 
insecticidal either alone or when presented together. It 
is a matter of routine to determine which parts are 

15 necessary or sufficient for insecticidal activity. 

As used herein the term "variant* * refers to toxins which,, 
have modified amino acid sequence but which share similar 
activity. Certain amino acids may be replaced with 

20 different amino acids without altering the nature of the 
activity in a significant way. The replacement may be 
by way of "conservative substitution ' ' where an amino 
acid is replaced with an amino acid of broadly similar 
properties, or there may be some non-conservative :x 

25 substitutions. In general however, the variants will be 
at least 60% homologous to the native toxin, suitably at 
least 70% homologous and more preferably at least 90% 
homologous . 

30 The term "derivative"' relates to toxins which have been 
modified for example by chemical or biological methods . 

These toxins are novel, and they and the nucleic acids 
which encode them form a further aspect of the invention. 

35 

A preferred Xenorhabdus speci s is the bacteria 
X.nematophilus . Particular strains of X.nema tophi las 
which are useful in the context of the invention are 
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ATTC 19061 strain, available from the National Collection 
of Industrial and Marine Bacteria , Aberdeen , Scotland 
( NCIMB ) . In addition, suitable strains include two novel 
strains of Xenorhabdus which were deposited at the NCIMB 
5 on 10 July 1997 and were designated with repository 
numbers NCIMB 40886 and NCIMB 40887. These latter 
strains form a further aspect of the invention. 



All strains have 


common characteristics as 


set out in the 


following Table 1 
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15 *NBTA (Oxoid nutrient agar containing 0.0025% 

bromothymol blue and 0.004% tetrazolium chloride) 

Preferably the pest target is an insect, and more 
preferably it is of the order Lepidoptera , particularly 
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Pieris brasslcae , Pieris rapae , or Plutella xylostella or 
the order Dlptera , particularly Cul&x quinquefaciatus. 

In a preferred embodiment of the invention, cells from 
5 Xenorhabdus species or agents derived therefrom are used 
in conjunction with Bacillus thuringiensis as an oral 
pesticide . 

In further embodiments, rather than using Bacillus 
to thuringiensis itself, pesticidal materials obtainable 
from B . thuringiensis (e.g. delta endotoxins or other 
isolates) are used in conjunction with Xenorhabdus 
species . 

15 The term 'obtainable from' is intended to embrace not 

only materials which have been isolated directly from the 
bacterium in question, but also those which have been 
subsequently cloned into and produced by other organisms. 

20 Thus the unexpected discovery that bacteria of the genus 
Xenorhabdus ( and materials derived therefrom) have 
pesticidal activity when ingested, and that such bacteria 
and materials can be used advantageously in conjunction 
with B . thuringiensis (and toxins or materials derived 

25 therefrom), forms the basis of a further aspect of the 
present invention. The pesticidal activity of 
B . thuringiensis isolates alone have been well documented. 
However, synergistic pesticidal activity between such 
isolates and bacteria of the Xenorhabdus species (or 

30 materials derived therefrom) has not previously been 
demonstrated . 

In still further embodiments of the invention, culture 
supernatant taken from cultures of Xenorhabdus species, 
35 particularly X. nematophilus , is used in place of cells 
from Xenorhabdus species in the methods above. 
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6 

All of these methods can be employed, inter in pest 

control . 

The invention also makes available pesticidal 
compositions comprising cells from Xenorhabdus species, 
preferably X.nematophilus , in combination with B . 
thuringiensis . As with the methods above, a pesticidal 
toxin from B . thuringiensis (preferably a delta endotoxin) 
may be used as an alternative to B ♦ thuringiensis in the 
compositions of the present invention 

Likewise, culture supernatant taken from cultures of 
Xenorhabdus species, preferably, X . nematophilus may be 
used in place of cells from Xenorhabdus species. 

Such compositions can be employed, inter alia, for crop 
protection eg. by spraying crops, or for livestock 
protection. In addition, compositions of the invention 
may be used in vector control. 

The invention further encompasses novel pesticidal agents 
which can be isolated from Xenorhabdus spp. Techniques 
for isolating such agents would be understood by the 
skilled person. 

In particular, such techniques include the separation and 
identification of toxin proteins either at the protein 
level or at the DNA level. 

The applicants have cloned and partially sequenced a 
region of DNA from Xenorhabdus NCIMB 40887 which region 
codes for insecticidal activity and this is shown as 
Figure 2 ( SEQ ID NO. 1) hereinafter. Thus in a preferred 
embodiment the invention also provides a toxin which is 
encoded by DNA of SEQ ID No. 1 or a variant or fragment 
thereof . 
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The invention also provides a recombinant DNA which 
encodes such a toxin. The recombinant DNA of the 
invention may comprise the sequence of Figure 2 or a 
variant or fragment thereof. Other DNA sequences may 
5 encode similar proteins as a result of the degeneracy of 
the genetic code. All such sequences are encompassed by 
the invention. 

The sequence provided herein is sufficient to allow 
10 probes to be produced which can be used to identify and 
subsequently to extract DNA of toxin genes. This DNA may 
then be cloned into vectors and host cells as is 
understood in the art. 

15 DNA which comprises or hybridises with the sequence of 
Figure 2 under stringent conditions forms a further 
aspect of the invention. 

The expression "hybridises with*' means that the 
20 nucleotide sequence will anneal to all or part of the 

sequence of Figure 2 under stringent hybridisation ^ 
conditions, for example those illustrated in "Molecular 
Cloning", A Laboratory Manual ' 1 by Sambrook, Fritsch and 
Maniatis, Cold Spring Habor Laboratory Press, Cold Spring 
25 Harbor, N.Y. 

The length of the sequence used in any particular 
analytical technique will depend upon the nature of the 
technique, the degree of complementarity of the sequence, 

30 the nature of the sequence and particularly the GC 
content of the probe or primer and the particular 
hybridisation conditions employed. Under high 
stringency, only sequences which are completely 
complementary will bind but under low stringency 

35 conditions, sequences which are 60% homologous to the 

target sequence, more suitably 80% homologous, will bind. 
Both high and low stringency conditions are encompassed 
by the term " v string nt conditions" used herein. 
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Suitable fragments of the DNA of Figure 2, i.e. those 
which encode pesticidal agents may be identified using 
standard techniques. For example, transposon 
mutagenesis techniques may be used, for example as 
described by H.S. Siefert et al., Proc. Natl. Acad. Sci. 
USA, (1986) 83, 735-739. Vectors such as the cosmid 
cHRIMI , can be mutated using a variety of transposons and 
then screened for loss of insectidal activity. in this 
way regions of DNA encoding proteins responsible for 
toxic activity can be identified. 

For example, the mini-transposon mTnJ3(HIS3) can be 
introduced into a toxic Xenorhabdus clone such as cHRIMI, 
hereinafter referred to as " clone 1', by electroporating 
cHRIMI DNA into E.coli RDP 1 4 6 ( pLB 101) and mating this 
strain with E.coli RDP146 (pOX38 ) , followed by E. coli 
NS2114Sm. The final strain will contain cHRIMlDNA with a 
single insertion of the transposon mTn3(HIS3), These 
colonies can be cultured and tested for insecticidal 
activity as described in Example 8 hereinafter. 
Restriction mapping or DNA sequencing can be used to 
identify the insertion point of mTn3(HIS3) and hence the 
regions of DNA involved in toxicity. Similar approached 
can be used with other transposons such as Tn5 and mTn5. 

Site directed mutagenesis of cHRIMI as outlined in 
^Molecular Cloning, A Laboratory Manual" by Maniatis, 
Fritsch and Sambrook, (1982) Cold Spring Harbor, can also 
be used to test the importance of specific regions of DNA 
for toxic activity. 

Alternatively, subcloning techniques can be used to 
identify regions of the cloned DNA which code for 
insecticidal activity. In this method, specific smaller 
fragments of the DNA are 6ubcloned and the activity 
determined. To do this, cosmid DNA can be cut with a 
suitable restriction enzyme and ligated into a compatible 
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restriction site on a plasmid vector , such as pUC19. 
The ligation mix can be transformed into E . coli and 
transformed clones selected using a selection marker such 
as antibiotic resistance, which is coded for on the 
plasmid vector. Details of these techniques are 
described for example in Maniatis et al, supra, (see 
p390-391) and Methods in Molecular Biology, by L.G. 
Davies, M.D. Dibner and J.F. Battey, Elsevier, (see p222- 
224) . 

Individual colonies containing specific cloned fragments 
can be cultured and tested for activity as described in 
Example 8 hereinafter. Subclones with insecticidal 
activity can be further truncated using the same 
methodology to further identify regions of the DNA coding 
for activity. 

The invention also discloses an isolated pesticidal agent 
characterised in that the agent is obtainable from 
cultures of X. nematophilus or variants thereof, has oral 
pesticidal activity against Pieris brassicae , Pierls 
rapae and Plutella xylostella , is substantially heat 
stable to 55°C, is proteinaceous , acts synergistically 
with S. thurlngiensis cells as an oral pesticide and is 
substantially resistant to proteolysis by trypsin and 
proteinase K. 

By 'substantially heat stable to 55°C is meant that the 
agent retains some pesticidal activity when tested after 
heating the agent in suspension to 55°C for 10 minutes, 
and preferably retains at least 50% of the untreated 
activity. 

By 'substantially resistant to proteolysis' is meant that 
the agent retains some pesticidal activity when exposed 
to proteas s at 30°C for 2 hours and preferably retains 
at least 50% of the untreated activity. 
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By 'acts synergistically ' is meant that the activity of 
the combination of components is greater than one might 
expect from the use of the components individually. For 
example, when used in conjunction with B . thuringiensis 
cells as an oral pesticide, the concentration of B. 
thuringiensis cellular material necessary to give 50% 
mortality in a P.brassicae when used alone is reduced by 
at least 80% when it is used in combination the agent at 
a concentration sufficient to give 25% mortality when the 
agent is used alone ♦ 

It has been found that the activity of the material is 
retained by 30 kDa cut-off filters but is only partly 
retained by 100 kDa filters. 

Preferably the agent is still further characterised in 
that the pesticidal activity is lost through treatment at 
25°C with sodium dodecyl sulphate (SDS - 0.1% 60 mins) 
and acetone (50%, 60 mins). 

Clearly the characterising properties of the isolated 
agent described above can be utilised to purify it from, 
or enrich its concentration in, Xenorhabdus species cells 
and culture medium supernatants . Methods of purifying 
proteins from heterogenous mixtures are well known in the 
art (eg. ammonium sulphate precipitation, proteolysis, 
ultrafiltration with known molecular weight cut-off 
filters , ion -exchange chromatography, gel filtration , 
etc.). The oral pesticidal activity provides a 
convenient method of assaying the level of agent after 
each stage, or in each sample of eluent. Such 
methodology does not require inventive endeavour by those 
skilled in the art. 

The invention further discloses oral pesticidal 
compositions comprising one or more agents as described 
above. Such compositions preferably further comprise 
other pesticidal materials from non-Xenorhabdus species . 
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These other materials may be chosen such as to have 
complementary properties to the agents described above, 
or act synergistically with it. 

5 Preferably the oral pesticidal composition comprises one 
or more pesticidal agents as described above in 
combination with B . thuringiensis (or with a toxin 
derived therefrom, preferably endotoxin } . 

10 Recombinant DNA encoding said proteins also forms a 
further aspect of the invention. The DNA may be 
incorporated into an expression vector under the 
influence of suitable control elements such as promoters, 
enhancers, signal sequences etc. as is understood in the 

15 art. These expression vectors form a further aspect of 
the invention. They may be used to transform a host 
organism so as to ensure that the organism produces the r; 
toxin . r 

20 The invention further makes available a host organism 

comprising a nucleotide sequence coding for a pesticial,. 
agent as described above. _ 

Methods of cloning the sequence for a characterised 
25 protein into a host organism are well known in the art. 
For instance the protein may be purified and sequenced: 
as activity is not required for sequencing, SDS gel 
electrophoresis followed by blotting of the gel may be 
used to purify the protein. The protein sequence can be 
30 used to generate a nucleotide probe which can itself be 
used to identify suitable genomic fragments from a 
Xenorhabdus gene library. These fragments can then be 
inserted via a suitable vector into a host organism which 
can express the protein. The use of such general 
35 methodology is routine and non-inventive to those skilled 
in the art. Such techniques may be applied to the 
production of Xenorhabdus toxins other than those encoded 
by the sequence of Figure 2 . 
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It may be desirable to manipulate (eg. mutate) the agent 
by altering its gene sequence (and hence protein 
structure) such as to optimise its physical or 
toxicoiogical properties . 

It may also be desirable for the host to be engineered or 
selected such that it also expresses other proteinaceous 
pesticidal materials (eg. delta- endotoxin from B. 
thuringlensis) . Equally it may be desirable to generate 
host organisms which express fusion proteins composed of 
the active portion of the agent plus these other toxicity 
enhancing materials. 

A host may be selected for the purposes of generating 
large quantities of pesticidal materials for purification 
e.g. by using B . thuringlensis transformed with the agent- 
coding gene. Preferably however the host is a plant, 
which would thereby gain improved pest-resistance. 
Suitable plant vectors, eg. the Ti plasmid from 
Agrobacterium tumefaciens, are well known in the art. 
Alternatively the host may be selected such as to be 
directly pathogenic to pests, eg. an insect baculovirus . 

The teaching and scope of the present invention embraces 
all of these host organisms plus the agents, mutated 
agents or agent-fusion materials which they express. 

Thus the invention makes available methods, compositions, 
agents and organisms having industrially applicable 
pesticidal activity, being particularly suited to 
improved crop protection or insect-mediated disease 
control . 

The methods, compositions and agents of the present 
invention will now be described, by way of illustration 
only, through reference to the following non-limiting 
examples and figures. Other embodiments falling within 
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the scope of the invention will occur to those skilled in 
the art in the light of these. 



FIGURE 

5 Figure 1 shows the variation with time of the growth of 
X. nematophilus ATCC 19061 and activity of cells and 
supernatants against P. brassicae as described in Example 
3. 

10 Figure 2 6hows the seguence of a major part of a cloned 
toxin gene from Xenorhabdus . 

Figure 3 shows a comparison of the restriction maps of 
cloned toxin genes from two strains of Xenorhabdus 
15 (clone 1 above and clone 3 below). 



EXAMPLES 

■ & 

20 

Example 1 - Use of X. nematophilus cells as an oral 

■m 

insecticide 

CELL GROWTH: A subculture of X . nematophilus (ATCC 19061, 
25 Strain 9965 available from the National Collections of 
Industrial and Marine Bacteria, Aberdeen, Scotland) was 
used to inoculate 250 ml Erlenmeyer flasks each 
containing 50 ml of Luria Broth containing lOg tryptone, 
5g yeast extract and 5g NaCl per litre. Cultures were 
30 grown in the flasks at 27 °C for 40hrs on a rotary shaker. 

PRODUCTION OF CELL SUSPENSION: Cultures were centrifuged 
at 5000 x g for 10 mins . The supernatants were discarded 
and the cell pellets washed once and resuspended in an 
35 equal volume of phosphate buffered saline (8g NaCl, 1.44g 
Na 2 HP0 4 and 0.24g of KH 2 P0 4 per litre) at pH 7.4. 
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ACTIVITY OF CELL SUSPENSION TO INSECTS: The bioassays 
were as follows: p. brassicae: The larvae were allowed 
to feed on an artificial agar-based diet (as described by 
David and Gardiner (1965) London Nature, 207, 882-883) 
into which a series of dilutions of cell suspension had 
been incorporated. The bioassays were performed using a 
series of 5 doses with a minimum of 25 larvae per dose. 
Untreated and heat-treated (55°C for 10 minutes) cells 
were tested. Mortality was recorded after 2 and 4 days 
with the temperature maintained at 25 °C. 

LC50 cells/g diet 

Treatment; 2 days 4 days 

Untreated 5.9 x 10 5 9.8 x 10 4 

Treated 55"C 7.1 x 10 5 1.4 x 10 5 

Aedes aegypti: The larva were exposed to a series of 5 
different dilutions of cell suspension in deionised 
water. The biosassays were performed using 2 doses per 
dilution of 50 ml cell suspension in 9.5cm plastic cups 
with 25 second instar larvae per dose. Untreated and 
heat-treated (55°C or 80°C for 10 minutes) cells were 
tested. Mortality was recorded after 2 days with the 
temperature maintained at 25°C. 

LC50 cells/ml 

Treatment- 2 days 

Untreated 5.1 x 10 6 

Treated 55°c 7.4 x 10 6 

Treated 80 °C > iq 8 

Cuiex auingne> fae/jf ^f" ■ The larvae were exposed to a 
single concentration cell suspension containing 4 xlO 7 
cells/ml. The biosassays were performed using 2 50 ml 
cell suspensions in 9.5 cm plastic cups with 25 second 
instar larvae per cup. Untreated and heat-treated (55«c 
or 80°C for 10 minutes) cells were tested. Mortality was 
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recorded after 2 days with the temperature maintained at 
25°C. 



% Mortality 
Treatment 2 days 

Untreated 100 
Treated 55°C 100 
Treated 80°C 0 



10 Thus these results clearly show that cells from X. 

nematophilus are effective as an oral insecticide against 
a number of insect species (and are particularly potent 
against P .brassicae) > The insecticidal activity is not 
dependent on cell viability (i.e is largely unaffected by 

15 heating to 55°C which reduces cell viability by >99-99%) 
but is much reduced by heating to 80°C, which denatures 
most proteins . 

Example 2 - Use of X . nematophilus supernatant as an oral 
20 insecticide 



CELL GROWTH: Cultures were grown as in Example 1. 

PRODUCTION OF SUPERNATANT: Cultures were centrifuged 
25 twice at lOOOOg for 10 mins . The cell pellets were 
discarded. 

ACTIVITY OF SUPERNATANT TO INSECTS: The Bioassay was as 
follows: 

30 Activity against- neonate P. brassicae and two day old 

Pieris rapae and Plutella xylostella larvae was measured 
as for P. brassicae in Example 1, but using a series of 
untreated dilutions of supernatant in place of of cell 
supensions and with mortality being recorded after 4 days 

35 only. 
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Insect species 
P. brassicae 
P. rapae 
P ♦ . xylostella 



LC50 (fjl supernatant /g diet) 
4 days 
22 
79 
135 



10 



In addition, size-reducing activity (62% reduction in 7 
days) against Mamestra brassicae was detected in larvae 
fed on an artificial diet containing X. nematophilus 
supernatant (results not shown). 



15 



20 



25 



Thus these results clearly show that the supernatant from 
X. nematophilus culture medium is effective as an oral 
insecticide against a number of insect species, and are 
particularly potent against P. brassicae . 

The. heating of supernatants to 55°C for 10 minutes caused 
a partial loss of activity while 80°C caused complete 
loss of activity. Activity was also completely lost by 
treatment with SDS (0.1%w/v for 60 mins ) and Acetone (50% 
v/v for 60 mins) but was unaffected by Triton X-100 (0.1% 
60 mins), non-diet P40 (0.1% 60 mins), NaCl (1 M for 60 
mins) or cold storage at 4°C or -20°C for 2 weeks. All 
of these properties are consistent with a proteinaceous 
agent . 



30 



35 



The general mode of action of X. nematophilus cells and 
supernatants i.e. reduction in larval size and death 
within 2 days at high dosages, and other properties, eg. 
temperature resistence, appear to be similar suggesting a 
single agent or type of agent may be responsible for the 
oral insecticide activity activities of both cells and 
supernatants. 

Example 3 - Timescale for appearance of ingestable 
insecticidal activity 
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CELL GROWTH: 1ml of an overnight culture of X. 
nematophllus was used to inoculate an Erlenmeyer flask* 
Cells were then cultured as in Example 1. Growth was 
estimated by measuring the optical density at 600 nm. 

5 

PRODUCTION OF CELL SUSPENSION AND SUPERNATANTS : These 
were produced as in Examples 1 and 2 . 

ACTIVITY OF CELLS AND SUPERNATANTS AGAINST P. BRASSICAE: 
10 The cell suspension bioassay was carried out as in 

Example 1, but using a single dose of suspended cells 
equivalent to 50 jjI of broth/g diet and measuring 
mortality after 2 days. The cell supernatant bioassay 
was carried out as in Example 2, but using a single dose 
15 equivalent to 50 pi supernatant/g diet (i.e. more than 
twice the LC50) and measuring mortality after 2 days* 

The results are shown in Fig, 1. Thus these results 
clearly show that cells taken from X. nematophilus 

20 culture medium are highly effective as an oral 

insecticide against P. brassicae after only 5 hours, and 
supernatants are highly effective after 20 hours. 
Although some slight cell lysis was observed in the early 
stages of growth, no significant cell lysis was observed 

25 after this point demonstrating that the supernatant 

activity may be due to an authentic extracellular agent 
(as opposed to one released only after cell breakdown). 

Example 4 - Synergy between X. nematophilus cells and 
30 B.thuringiensis powder preparations 

CELL GROWTH AND SUSPENSION: X. nematophilus cells were 
grown and suspended as in Example 1. S. thuringiensis 
strain HD1 (from Bacillus Genetic Stock Centre, The Ohio 
35 state University, Columbus, Ohio 43210, USA) was 

cultured, harvested and formulated into a powder as 
described by Dulmage et al. (1970)" J. Invertebrate 
Pathology 15, 15-20. 
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ACTIVITY OF X. NEMA TOPH ILUS CELLS AND B • THURINGIENSIS 
POWDER AGAINST P. BRASSICAE : The bioassays was carried 
out using X. nematophilus and fl. thuringiensis in 
5 combination or using fl. thuringiensis cell powder alone. 
Bioassays were carried out as in Example 1 but with 
various dilutions of B. thuringiensis powder in place of 
X. nematophilus . For the combination experiment, a 
constant dose of X. nematophilus cell suspension 
10 sufficient to give 25% mortaility was also added to the 
diet. Mortality was recorded after 2 days. 



LC50 (vq Bt powder/g diet) 

Pjpassay 2 days 

15 B.t. alone 1.7 
B.t. plus X. nematophilus 0.09 

These results clearly demonstrate the synergism between 
X. nematophilus cells and B. thuringiensis powder when 
20 acting as an oral insecticide against P. brassicae . 

Example 5 - Synergy between of X . nematophilus 
supernatants and B . thuringiensis powder 

25 CELL GROWTH AND PRODUCTION OF SUPERNATANTS: X. 

nematophilus cells were grown and supernatants prepared 
as in Example 2. B . thuringiensis was grown and treated 
as in Example 4 . 

30 ACTIVITY OF X. NEMATOPHILUS SUPERNATANTS AND Bt CELL 
POWDER AGAINST P. BRASSICAE: 

The bioassays were carried out using X. nematophilus 
supernatants and B. thuringiensis in combination or using 
B. thuringiensis powder alone. The Bioassay against 
35 neonate P. brassicae and two day old Pieris rapae and 

Plutella xylostella larvae were measured as in Example 2 
but with various dilutions of B . thuringiensis in place 
of X. nematophilus . For the combination experiment, a 
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constant dose of X. nematophilus supernatant sufficient 
to give 25% mortality was also added to the diet. 
Mortality was recorded after 4 days. 



10 



diet 

Insect species 



pt alone 



LC 50 Bt powder/g) 



3t Plu& Xn 



P. brasslcae 

P. rapae 

P. xylostella 



1.4 
2.5 
7.2 



0. 12 
0.26 
0.63 



These results clearly demonstrate the synergism between 
X . nematophilus supernatants and fl. thuringiensis powder 
when acting as an oral insecticide against several insect 
15 species. The fact that both X. nematophilus cells and 

supernatants demonstrate this synergism strongly suggests 
that a single agent or type of agent is responsible for 
the demonstrated activities. 

20 Example 5 - Characterisation of insecticidal agent from 
X. nematophilus supernatant by proteolysis 

CELL GROWTH AND PRODUCTION OF SUPERNATANTS: X. 
nematophilus cells were grown and supernatants prepared 
25 as in Example 2 . 

PROTEOLYSIS OF SUPERNATANT: Culture supernatant (50ml) 
was dialysed against 0.5 M NaCl (3x11) for 48 hours at 
4°C. The volume of the supernatant in the dialysis tube 
30 was reduced five-fold by covering with polyethylene 

glycol 8000 (Sigma chemicals). Samples were removed and 
treated with either trypsin (Sigma T8253 = 10,000 
units/mg) or proteinase K (Sigma P0390 = 10 units/mg) at 
a concentration of 0.1 mg protease/ml sample for 2 hours 

35 at 30°C. 

ACTIVITY OF PROTEASE TREATED SUPERNATANT AGAINST P. 
BRASSICAE: The boassay against n onate P. brassicae 
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larvae was carried out by spreading 25 ^1 of each 
'treatment' on the artificial agar-based diet referred to 
in Example 1 in a 4 . 5 cm diameter plastic pot. Four pots 
each containing 10 larvae were used for each treatment. 
Mortalities were recorded after 1 and 2 days. Controls 
using water only, trypsin (0.1 mg/ml ) and proteinase K 
(0.1 mg/ml) were also tested in the same way. 



10 Treatment 



% Mortality 

1 day 2 days 

Untreated supernatant 50 100 

Proteinase K treated supernatant 45 

Trypsin treated supernatant 

All controls (no supernatant) 0 0 



100 

40 100 



Example 6 

gnto^ipgidal activity of ot h er Xcnnr-h^ hH,^ 

Using the methodology of Examples 1 and 2, four different 
xenorhabdus strains were tested against insect pests. 
The results obtained were as follows: 
I) Activity to Pieris brassicae 



Strain deposit 
no/code 


Cells 10 G /grm diet 
% mortality 


Supernatant LC50 
Ul/gram of diet 






0.09 


0014 


100 


0.52 


0015 


80 


3.73 


NCIMB 40886 


100 


0 . 05 



25 



It was found that entomocidal activity of cells and 
supernatant was reduced by more than 99% when all four 
strains were heated at 80°C for 10 minutes. 
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II) Activity to mosquitoes (Aedes aegyptl) 
Bacteria added at the rate of 10 7 cells/ml of water 



Strain deposit 
no/code 



Cells 10 /grin diet 
% mortality 



NCIMB 40887 



0 



0014 



40 



0015 



45 



NCIMB 40886 



95 



5 Furthermore, all strains significantly reduced the growth 
of Hellothls virescens. 

Cloning of toxin genes from strains of Xenorhabdus 
10 Total cellular DNA was isolated from NCIMB 40887 and ATCC 
19061 using a Quiagen genomic purification DNA kit. 
Cells were grown in L borth (lOg tryptone, 5g yeast 
extract and 5g NaCl per 1) at 28°C with shaking (150rpm) 
to an optical density of 1.5 A6oo • Cultures were 
15 harvested by centrif ugation at 4000xg and resuspended in 
3.5mls of buffer Bl (50mM Tris/HCl, 0.05% Tween 20, 0.5% 
Triton X-100, pH7.0) and incubated for 30 mins at 50°C. 
DNA was isolated from bacterial lysates using Quiagen 
100/G tips as per manufacturers instructions. The 
20 resulting purified DNA was stored at -20°C in TE buffer 
(lOmM Tris f ImM EDTA, pH 8.0). 

A representative DNA library was produced using total DNA 
of NCIMB 40887 and ATTC 19061 partially digested with the 

25 restriction enzyme Sau3a. Approximately 20|ig of DNA from 
each strain was incubated at 37°C with 0.25 units of the 
enzyme. At time intervals of 10, 20, 30, 45 and 60 
minutes, samples were withdrawn and heated at 65°C for 15 
minutes. To visualise the size of the DNA fragments, the 

30 samples were electrophoresed on 0.5% w/v agarose gels. 
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The DNA samples which contained the highest proportion of 
30 to 50kb fragments were combined and treated with 4 
units of shrimp alkaline phosphatase (Boehringer) for 15 
minutes at 37°C, followed by heat treatment at 65°C to 
inactivate the phosphatase. 

The size selected DNA fragments were ligated into the 
BamHl site of the cosmid vector SuperCos I (Stratagent) 
and packaged into the Escherichia coli strain XL Blue 1, 
using a Gigapack II packaging kit (Stratgene) in 
accordance with the manufacturers instructions. 



To select for cosmid clones with entomocidal activity, 
individual colonies selected on L agar plates containing 
25|ag/ml ampicillin, were grown in L broth (containing 
25(ig/ml ampicillin) overnight at 28°C. Broth cultures 
(50^1) were individually spread onto the surface of 
insect diet contained in 4.5cm diameter pots, as 
described in Example 5. To each container 10 neonate P. 
20 brassicae larvae were added. Larvae were examined after 
24, 72 and 96 hours recording mortality and size of 
surviving larvae. A total of 220 clones of NCIMB 40887 
were tested, of which two were found to cause reduction 
in larval growth and death within 72 hours. Of 370 
clones from ATTC 19061, one was found to cause larval 
death within 72 hours. 

Example 8 

Activity of cloned toxin gen es to Pieris hrassicae 
The three active clones from Example 7 were grown in L 
broth, containing 25^g/ml ampicillin, for 24 hours at 
28°C, on a rotary shaker at ISOrpm. The activity of the 
toxin clones to neonate larvae were performed by 
incorporation of whole broth cultures into insect diet, 
as described in Example 1 . 



35 
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Clone No 



strain 



LC50 (al broth/a insect diet ) 



1 



NCI MB 40887 



13.03 



2 



NCIMB 40887 



16.7 



3 



ATTC 19061 



108. 7 



Control* 



No effect at lOOjil/g 



*XL1 Blue E. coll broth 

5 

When E. coli toxin clones were heated at 80°C for 10 
minutes and added to the diet at a rate of lOOjil/g, no 
activity to larvae was detected. Highlighting the heat 
sensitivity of the toxins. 

10 

Example 9 ^ 
Sequencing of the cloned toxin from NCIMB 40887 

Cosmid DNA of the entomocidal clone 1 above from NCIMB 

15 40887 was purified using the Wizard Plus SV DNA system 
(Promega) in accordance with the manufacturers 
instructions . A partial map of the cloned fragment was 
obtained using a range of restriction enzymes EcoRl, 
BamHl, Hindlll, Sail and Sacl as shown in Figure 3. DNA 

20 sequencing was intiatiated from pUC18 and pUC19 based 

sub-clones of the cosmid, using the enzymes EcoRl , flamHl, 
Hindlll, EcoRV and PvuII. Sequence gaps were filled 
using a primer walking approach on purified cosmid DNA. 
Sequence reactions were performed using the ABI PRISM™ 

25 Dye Terminator Cycle Sequencing Ready Reaction Kit with 
AmmpliTaq DNA polymerase FS according to the 
manufacturers instructions . The samples were analysed on 
an ABI automated sequencer according to the manufacturers 
instructions. The major part of the DNA sequence for the 

30 cloned toxin fragment is shown in Figure 2 . 
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Example 10 

Restr i cti on map of cloned toxin from clone 3 
Cosmid DNA of the entomocidal clone 3 above was purified 
as described in Example 9. A restriction map- of the 
cloned fragment was obtained using the restriction 
enzymes BamHl , Hindlll, Sail and Sacl and this is shown 
in Figure 3. When compared with the map from clone 1 
(Figure 3) it is clear that over the regions which 
overlap, the restriction maps are very similar. The 
only detectable difference between the two clones was a 
reduction in size of two Hindlll fragments in clone 3, 
corresponding to the 11.4kb and 7,2kb Hindlll fragments 
in clone 1 by approximately 2Kb and 200bp respectively. 
These results indicate the overall relatedness of the DNA 
region coding for toxicity in the two bacterial strains. 

Example 11 

Southern Blot Hybridisa tion Experiments 

A 10.3kb BamHl-Sall fragment of the DNA from clone 1 was 
used as a probe to hybidise to total Hindlll digested DNA 
of the Xenorhabdus strains ATCC 19061, NCIMB 40886 and 
NCI MB 40887. Hybridisation was performed with 20ng/ml of 
DIG labelled DNA probe at 65°C for 18 hours. Filters 
were washed prior to immunological detection twice for 5 
minutes with 2 x SSC (0.3M NaCl, 30mM sodium citrate, pH 
7.0)/0.1% (w/v) sodium dodecyl sulphate at room 
temperature, and. twice for 15 minutes with 0.1 x SSC 
(15mM NaClm 1.5 mM sodium citrate, pH 7.0) plus 0*1% 
sodium dodecyl sulphate at 65°C. The probe was labelled 
and experiments performed in accordance with 
manufacturers instructions, using a non-radioactive DIG 
DNA labelling and detection kit ( Boehringer ) . The probe 
hybridised to a Hindlll fragment of approximately 8kb in 
all three strains as well as an 11.4kb fragment in NCIMB 
40887 and an approximate 9kb fragment in both NCIMB 40886 
and ATCC 19061. These results show that strains NCIMB 
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40886 and ATCC 19061 contain DNA with close homology to 
the toxin gene of clone 1 above, confirming the 
similarity between the toxins produced by the three 
strains. 
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CLAIMS 

1. An insecticidal composition adapted for oral 
administration to an insect comprising a pesticidal 
material obtainable from a Xenorhabdus species, or a 
pesticidal fragment thereof, or a pesticidal variant or 
derivative of either of these. 

10 2. a composition according to claim 1 wherein the said 
pesticidal material comprises material encoded by the 
nucleotide sequence of Figure 2 or variant or fragment 
thereof, or a sequence which hybridises with said 
sequence. 

15 

3. A composition according to claim 1 or claim 2 which 
comprises cells of Xenorhabdus. 

4. A composition as claimed in any one of the 

20 preceding claims which comprises supernatant, taken from 
cultures of cells of Xenorhabdus species. 

5. A composition according to any one of the preceding 
claims wherein the Xenorhabdus species is Xenorhabdus 

25 nematophilus . 

6. A composition according to any one of claims 1 to 4 
wherein the Xenorhabdus species is ATCC 19061, NCIMB 



40886 or NCIMB 40887. 

7. A composition as claimed in any one of the preceding 
claims which comprises a further pesticidal material not 
obtainable from Xenorhabdus. 

35 8. a composition according to claim 7 wherein the said 
further pesticidal material comprises a material 
obtainable from B. thuringiensis . 
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9 . 



A composition according to claim 8 which further 



comprises cells of B . thuringiensis . 

10. A composition according to claim 8 wherein the 
pesticidal materials obtainable from B . thuringiensis 
comprises the delta endotoxin. 

11. A composition according to any one of the preceding 
claims which further comprises an agriculturally 
acceptable carrier . 

12. A composition according to claim 10 wherein the 
carrier comprises items of insect diet. 

13. A method for killing or controlling insect pests, 
which method comprises administering to a pest or the 
environment thereof a composition according to any one of 
the preceding claims. 

14. A method as claimed in claim 12 wherein the pests 
are insects from the order Lepidoptera or Diptera. 

15. A microorganism comprising Xenorhabdus strain NCIMB 



16. A microorganism comprising Xenorhabdus strain NCIMB 
40887. 

17. A pesticidal agent which comprises a a toxin 
comprising a protein which is encoded by DNA which 
includes SEQ ID No. 1 or a variant or fragment thereof. 

18. An isolated pesticidal agent characterised in that 
it is obtainable from cultures of X. nematophilus or 
mutants thereof, has oral pesticidal activity against 
Pieris brassicae , Plerls rapae and Plutella xylostella , 
is substantially heat stabl to 55°C, is proteinaceous, 
acts synergistically with B . thuringiensis cells as an 



40886 . 
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oral pesticide, and is substantially resistant to 
proteolysis by trypsin and proteinase K. 

19. An isolated pesticidal agent as claimed in claim 18 
further characterised in that the pesticidal activity i 
substantially destroyed by treatment with sodium dodecyl 
sulphate or acetone or heating to 80°C. 

20. An isolated pesticidal agent as claimed in claim 18 
or claim 19 further characterised in that the agent is an 
extracellular protein. 

21. A recombinant DNA which encodes a pesticidal agent 
according to any one of claims 17 to 20. 

22. A recombinant DNA of claim 21 which comprises the 
sequence of Figure 2 or a variant or fragment thereof. 

23. A recombinant DNA which comprises or hybridises 
under stringent conditions with all or part of the 
sequence of Figure 2, and which encodes a pesticidal 
material. 

24. An expression vector comprising a recombinant DNA 
according to any one of claims 21 to 23. 

25. a host organism which has been transformed with an 
expression vector according to claim 24, 

30 26. A host organism as claimed in claim 25 which has been 
engineered or selected such that it also expresses other 
pesticidal proteinaceous toxicity enhancing materials 



27. A host organism comprising a nucleotide sequence 
coding for a fusion protein comprising a pesticidally 
active portion of an agent as claimed in any one of 
claims 17 to 20 in combination with other pesticidal 
proteinaceous toxicity enhancing materials. 
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28. A host organism as claimed in claim 27 wherein the 
pesticidal toxicity enhancing materials comprise delta- 
endotoxin from B. thuringiensis . 

5 

29. A host organism as claimed in any one of claims 25 to 
289 wherein the host is a plant. 

30. A host organism as claimed in any one of claims 25 to 
10 28 wherein the host is a virus pathogenic to insects. 

31. A fusion protein as expressed by a host as claimed in 
claim 27 . 

15 32. An pesticidal composition comprising one or more 
agents as claimed in any one of claims 17 to 20. 
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Fig.2. 

1 TCCACAATTG CCGGAGAAAA TCAGTCGGGA ACTGCCGGTG ATTATTCGTC ACTTATTAAA 
61 CGAATTTGCC GACCAGAATA AGGCTAAAAA ACTG CTACAG GCG CAACGCG ACTCGAACGA 

121 AG CGTTAACG GTAAAGAGTC ATTCGGATCC GCTGTATCGC TTTTGTGGTT ATCTGGTGTC 

181 TGTCAATGAT ATGACCGGAA TGAAGATGGG CAATAAAAAC ATTAGCCCAC GAGCACCGAG 

241 ATTG TACTTG TATCATGCCT ATCTCTCTTT TATGGAAG CG CACGGCTTTG AACGTCCGTT 

301 AACACTGACT AAGTTTGGTG AATCCATCCC CAAGATTATG CTGGAATACC GGAAGGAGTA 

3 61 TCGAAAAGTG CGAACCAAGA AAGG CTATTC CTATAACGTG GAATTATCGG AAGAGGCCGA 

4 21 AG AAT GG CTA CCGTCAGTGC CTGAGTGTCG AGACTTTAAA TCACCTGTAT AAAACTTTGA 
4 81 GCTTTAAGTC TGCA CTCCAT ACACAACTTA AAAT AT CTAA TTGTATTTAA AAGAAAATAA 
541 TAGATGTATA GTTATTTTTT AACTATACAT AAGCTCTACA TGCTCTTCAT TCGTGTAAAA 
601 AATGGGTGAA CAGGTGATAC AGTCAGTGAA TATCATATTA ATTACCGTAA ACCCAGATGT 
661 AGCAAGGCTT TCAGGGAATT GTGCAGAGGG TGCATAACTG AGAGGGTGAA AAAGATTTTC 
721 AGGGGGGCTT ATGGCAGGTA AACAAAATCA GAAGCAAATA CCGTGCACAA TCTGG' T TTTT 
781 ATTTTTTGGT A C T A C CTCAA ATTAAAATGA TGTAATCATC TGATTTTATT TAAGAATAGA 
841 AGTTAATCAC AATTTCATTG ATGGACTTTC ATTCACACTG GTATAGATAA ATAATTCTGT 
901 TATATCCTGT TTCATTACGC ATT CAT CAGG AGTGCTGTTA CAGGAGACAA GAATGTCACA 
961 CATCATTTAC TTGTCGTTAA AGGGCAAGAA GCAGGGTTTA ATTTCAGCGG GTTGTTCAAC 

1021 GCCTGAATCA A TTGG AAATC GCTATCAAAA AGGACGTGAA GATCAAATAC AGGTATTGAG 

1081 CCTGAATCAT TCGATGAGCC GTGACCAGAA TGTTAATCAT CAACCCGTCA GTTTTGTGAA 

114 1 ACCCATTGAT AAATCCTCTC CCCTGTTTGC TGGATGCCAG TTTTGTG CAT TACAGGACAA 

12 CI GCCAGATGGG ACAACTGGAG TTCTTTTATG AAA TCAAG CT GACCAGTGCC ACGATTGTGG 

12 61 ATATTTCCTA TAATTATCCG GCATTCAATC AA TG A TAA TG GTGCGATACC CCATGAAGTG 

13 21 GTGATGCTCG ATTATAAGTC CATTTCATGC AA C CA CAT CG CCG CAGG ACT TCGGGCTACA 
13 81 G CAT A CG CAA TTAGC CGGAA GTGAAGAAGC AAGCCGCTTT TATCTGGGGT CTCGAATGTT 
1441 AAGCCACTTA AGAAGCCGCT GGTTGAAGAA ACCCCGGTAA AACCCGCTAA ACATCATGCC 
1501 CGTTATCGTT GTGTGGATGA TGACGGCAAT CTTTTAACCG AACGCAAGTA TCGGGTTTGC 
1561 CTGCCGGATG G T CAG AT AAA AGAAGGAAAG ACTGATAAAC AAGGTTACAC CCAATGGCAT 
1621 CTTACGGATG ACAAAAATAA ACTTGAATTT CAT A 7 !"T "i AA AGGATTAATA CCATGCCAGC • 
1681 CTATACCGTT CAGACAAAAA TAGAATCCAA CGTACCTGTT GAAAACCTGC TTTACGACTT 
1741 AA C CATTT AT CGTAAGGATG CAAAAGGAAA TTTCCATATC TTGCTTGATG TTTTTCAGGA 
1801 GAAACTACAG AGTAATTATG AAACACAACA GCATATCACG CAGGAAATAG ACGACGATC r 
1861 TTCTG TG ATT TATATTATGC AAATTATG C T TCATCGCAAA CATGGCTCAA ATATATTTCC 
1921 GGCACTGCAA ACCCATTTTA AGAAAATGTA TACCCTCGGT GAATTAACTT CCGGTAAAG^ 
19 81 CTGTTCGGAG AAAAAACGGG AAAATGCCTG TTATTTTGAA AGTACAGTTG AAACAAAACC 

2 041 TGTCAGCGAC GGGGATAATA CCGTTGACTT AAA T A T CA C T ATTCCTGAAC GACCTTTTAT 

2101 TGCCAAAGAA TATCCCATTG GTCACCCACA CGATC CATTT GAAAAAAGTA AAATTGAATC 

2161 ATAAATACAG GACAGGTTAT CGAAAAGAAT TTATCCGGAT CAAAATGGAG CAAG TTTATG 

2221 TCAGGGCGCG AGCACACTAT TTTAGCTG CG TTTTTAAGAT GATTATCTCT TAATGTTCAG 

22 81 TTTTAATAGT GTTTTTATCG AGTGAAATTT AATCGCACAG GCAATTCTTT AGACTTTTAT 

2341 AGAAAACTAA AGAATTAAAG AACAAGATTG ACATTTTAAG TTCAAATATT AATCAAAGTA 

2401 TGCTCGCGCC CTGAGTTTAT GTGGCCCTGC CGCTTTTTTT TATTGCCTGC CAATAGATAG 

2461 ACCAGATATT TATG AG CAAG CGGCACGAGA ATTATGGCAA TATGG CCGAA CTAAAATTGG 

2521 T CAA CTGGAA ATTAAG C CGG GTGAGGGTTG CCGACATCCT AAAGGT A CTT TTTATAATCA 

2581 ATATGGTGAA AGAATATCTG GGTTAGATTG GCTGACATTG G CAAG C CTAA GAGATTCAGA 

2641 AAATATGATG ATGAGGTTGA TGATGAAGTA GCTGGT ATT A CAATG TGGGG AAAATTGACA 

2701 GAATGGTTTG AAAAATCAGG GTATGAAAAA GTATTTAGTA ATGTCGGCTT ATCCCATTCT 

27 61 AATATAAATG ACATAGTAAC TCTTAGTGAT TACTATAACA AAGGATATCA TGTTGTTACT 

2 821 TTGATTTCAG CAGGAATGTT ATCAGATTTT GGTGACATAG AAACATCAGG AAAAAATCAT 
2881 TGG AT AG TTT GGG AAGGAGT AGTAGAAAAC TATGAGAAAG AAAATATCAC AAATAATTCA 
2941 GATCTGAATC AATATGTAAA TTTAAATCTG TTTTCATGG-G GTAAAGTGGA ACATCAAA' r T 

3 001 AAAAAAAACA AATCACTAGA TTATGTACTC AACCATATTT TTTGAGGGTT GGTTTTTAAA 
30 61 C CAATG AAAT AACATGAAAA AAATATTAAT TATTTTTATT TTTTTACTTT ATGGTTGTGG 
3121 TAA TC CAA CG CCAAAAGTTT TACCAAAATC AG AG TTT CTT CCTGATGCAG TGATAAATGA 
3181 A C CAT ATCAG G C ATCAATT A CCATCACAGG AGGTGCATTG AATGAAAAAA GCGTTTGGGT 
3241 AAAAATTCAT CCTACTGG CT CAGGACTAAC ATGGAATCCA AAAGATAGTT CTTTC CTAT A 
3 3 01 GGGTGGAAAA AAAGAAATAA GAAAAGATTA TCATCATATA AATATAACAG GTACCCCAAA 

33 61 GAAGACAGAA TTGATAAAAA TTGAAGTGGT AGGATTTACA TTGGGTACAA TGTACGCA CG 

34 21 GAAAGAGTTC ACTATAAATT ATACTATAAA AGTAAGGGAA TAATTGTCAC TATCAGAATG 
34 81 GTGATTTAAT TCG CCATTTT TATACTTTTG TATACTCTCT CAACATAATC AGGATT"TT r 
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SEES?? ISSJSS ISSSSE ^t^at 

ATGAACTCGA AGATAATCCT TTctJaScT C^rrzl^ CTGACTTTCT GCCTGTTCTT 
AATCACCCAC ACCGGGACAT TCgSgSS S^a^S ^"5°^°° ACTCGCTTTG 
AGCAAGCCGT CCCAGATCAC r-rrr^^nS^ GGAACGGGTT TACTCATGCT TGCCAGAGGG 
GTTCACATGT SSJ S£XJ£? TcSSSK SSSSS CTGTAAlSg 
CCATGACGCC TGACATGATT GCCACTGrrr ™^ TGCCGGAGGC CGGTATCTCG 
GCGTCGAAGC CAGGCAGGGT SSS^CCT wSiSSSSS SSSSSS TCCCTGA CGT 
AAAAAACAGG GGCTCCCCTA TAAACGC CCC C^r^S^ AAACGCTGGC GAATACCCTC 
CGGAGTTTGC TGAAAAATCC GCCTTGCTGA Sff??™* TTAAAAAAAG CGCAATAAAA 
ATTACCGTCT GGTCTATTTT GAGTtcSS ™£^ GCA "GTCAGGAC 

TTACCAGACA ACGTCAGGCA GTATCACGCG SKI^r Iffi ^T* ACACGCTGTT 
GGCCAGACAA GGGACAACCG CCTCaHtS StcSJS TAGAGCCGGT 
GGGATAGAGG AAAAAATCAG AAATGGCGGT TrAr^^?S A ™ATGCGCG TATCCATCAC 

cTTcccGcrr acagcccSa S S SSiSS TTTATTCTAT 

GACTGGCGAC GTTTTATCAC CTGGACTCAG GATACAATCG SSS^* GGCCAAATAC 
TTGAAAGGTT ATGGCGACCA ATTTGCAATT MCTmS ^SISSS? A**™™* 
GTCAGTCGAG GTTTTTTCAT TTCGGGTCGT GnSlZJ^T ^? ACTTAG TAAGAATAGA 
TCTGAAAATT GCTgStCTG SScSc S?^^ GTTTGTAATC 

TGTCAACATA CTGTTAAGTT AGaS£g£ aSSS S^SSE SKSET 
AAATCACTTG G A CAATATTT TATTTCACAT rararRTT^ A^AAT ACAAAACAAT 

TCAGTTATAA CCGAATaSS SSg££ ^CAATCTGG 

ttaacgtaaa agttcataaa gaaaattatt taatt??SJ SSSS? 1^™^ 

TGTGTTTTGT TAATGAATGA ATAACCAGGT AAGCTGGAT? iI^TS^ 
TACAATATGC TATTTATTTA TATAAAGAGT TTGTG CC CA T -ZfZnnil™ AATTACTCGT 
TTCAACCGTA ACTTAGCTTC ATCGACTTTT G^CTOgSS S^^SS AACAAATTTG 
ATCCTATTTA TTTATGATAA ATAaSSJ SSSSS SSSSgS S 17 

TGTGCTCAAT CTTGGATTCA AGTATGTATT CCTTTTGGTA -rnr^SS^ - 

5221 GATGAAGAGG ATGCCAACAT GACACAATAT cSS^S? tSIa^* 3™*°°°* 
52 81 TAAATTTTAT GATTAAAATG AAATTTTAGT AGaS^cSt ISSSS£ ^ CTCAG ™ 
5341 ATAGCATCCT CTTTAATATC ATTAAT CTC A- rtT^I^T ^ITS™ 1 ^ GG CATTTACA 
5401 ATAATGACTT ACAAAATAAG SSSS S2J?£S£ iSS™ TGTGAATAGA 
5461 TTATAAAATA ATTGAGGTTA TTATGTATAG cSS??*?" ^CACTATT 
5521 CACTCGCGAC GGTCAGACGA TGACTCTTGC GGATCT^CAA AAATCAGTCC 
55 81 GAGAAAAATC TTTGATGACC AGCTCAGTTP ^-I-I^^ TATTTATCCT TCAGTGAACT 
5641 TATAGAGCAG SISIaaS mSSS?? ££&S?S SSSSS^ ATCATGAAAG 
5 701 ACAATTATCC GGTGCTATCC GACTCGGTAT KwSr*- A ,?JT?T ACCC G TGCCAACCC 
5 761 TGAAATGTTT GGTGCCCGTT CTTCT^CcS ^t^*^ ^SE™ 0 GCAGTTATGA 
5821 rrCACCGGCT GGCTATCtS SSSgS ^CGTGAACr^ ^ZJ^P 0 CTTCCA-TCTT 
— CTCTGCTTAT CATCTTGATA ATCG^Scc gSJSgS ATm ™G 
TAATATGGAT ACAGAAATTT CCAcdCTGAC ' A^SS^C S£S?SS ™ GCCAGAG 
ACCCGCAAGA CCGGAGGTGA TTCGGACGCA TCATgSSa GC???S?p ™* GCTATT 
G CCATTGATA CCCCTTACCA TCAGCCTTAC GAGACTAT~~ ^I^J^S TTACCGTCAG 
GACAGTACAC TGTCAGCGCT GTCOGTAA? cSStcA -SSSI^h I ATGAC( =CAT 
TCATTACTGG CGATTCTGGC CAATATTTCT cSSS™ I^^S ^AAGGGGCT 
ATTACGGAAA AGAACGCTGA TGCTTTATTT gSSSaIc? ^SSIIT GAGGGA AGAG 

gaaaatitcg. cgtcacaatc atggatagcc aagtatta'S ™ Tt »CGCCC 

CAAAAATACC TCGGGATGTT GCAGAATGGC tJSSaS gSSSS S^ AGGTG 
AATATCTCAA CGGGTTTAGT GGTCAATAAT GAAAGTAaS T^l^^ TTATGTGGAT 
CGTGTAAAAA CAGATGATTA TGATAAACAT GTAAATTACT SS^St ^ ATAACA 
AATAATCAAT TCTTTATATG TGCTAATTTT AAGATATCGA ^tTSS^ GTATGAAGGA 
AGGAAAAACT CAGGGACAAG TGGCATTGTC G^SScJm ^cSJS^ ^CGACTCTT 
ACTAATTTCA AAAGCAATTA CTTAAGTAAC ATATctSS aJSIJaS ^I^^' 1 ' 
AAAATATATG CCTATCGCTA TACGTCTTCC ACCAGCGC^ rl?^^ AAATGGCGTA 
TTCACTTTTG AGTCTTATCC CCTGACTATA ^CG^^2 ^^J^^t CGGCGGAATA 
TTGTGCCTGA CTAGCGGGCT tSSSS? Sa^SSa^ ctSSSS JSS^ 
GCACAAGGCA TCATCAACGA CTCCGTTCTG ACCAAAGTTT Tr-rS^S CAGTGACAAT 
CACCGTTATG CACTGAGCTT TGATGATGCA • CAGGTACT^T I^^^I GTTCTACAGT 
TATGCCCGAC GATGACAGTG TCAGTCATTT TAACrrTTTr £2? hTCGGT CATTAATCAA 
AGGGAAAATC TTTG AAG C CG gI?SSg SS" 

TACCTITGCC CGTTCAGCCC TGATGCGTGG TCTGGGGAT^ SSSSr ^^^^ 

726i SSS?SK c c ^?Sc? ?SSSSS5 ^-in SJSSS5S JJSSS 

7321 ACTGTG7ATG C7TTATGGTT SSSSf? SS&SGS SSEggSJS SSSgS 
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73 81 CGGGGAGTTG TCACGGCTGG TTATCTGGTT GTATCAGGTG ACGCAGTGGC TGACTGAGGG 

74 41 CGG AAATCAC CACTGAAGCG ATCTGGTTAT TATGTACGCC AGAGTTCAGC GGGAATATTT 
7 501 CACCGGAAAT CAGTAATCTG CTTAATACTC TCCGACCCCG TATTAGTGAA GACATGGCAC 
7 5 61 AAAGTAGTGA CCGGGAGCTT CAGGCTGAAA TTCTCGCGCC GTTTATTGCT GCAACGCTGC 
7 6 21 ATCTGGCGTC ACCAGATATG GCG CGGTATA TCCTGTTGTG GACTGATAAC CTGCGGCCGG 
7681 GCGGCCTGAA TATCGCCGGA TTTATGATGC TGGTGCTGAA AGAGACGCTG AGTGATGAGG 
7 741 AAACGACCCA ACTGGTTCAA TTCTGCCATG TAATGGCACA GTTATCGCTT TCCGTGCAGA 
7 801 CACTGCGTCT CAGTGAAGCA GAGCTTTCTG TGCTGGTCAT TTCCGATTTT GTGGTACTGG 
7 861 GTGCGAGAAG CCAACCGCCG GACAACACAA TATTGATACT CTGTTCTCAC TCTACCGATT 
7 921 CCACCAGTGG ATTAATGGG C TGGGAAATCC CGGCTCTGAC ACGCTGGATA TGCTGCGCCA 

7 9 81 AGCAGACACT CACGGGCGAC AG ACTGGG C C TCCGTGATGG GGCTGGACAT CAGTATGGTA 

8 041 ACGCAGGCCA TGGGTTCCCG CCGGCGTGAA CCAACTTCAG TGTTGGCAGG ATATCAACCC 
8101 CGTGTTGCAG TGGATACATG TGGCATCAGC ACTGCTCACT GATGCCGTCG GTTATCCGTA 
8161 CGCTGGTGAA TATCCGTTAC GTGACTGCAT TAAACAAAGC CGAGTCGAAT CTGCCTGCCT 
8 221 GGGATAAGTG GCAGACGCTG GCAGAAAATA TGGCAGCCGG ACTGAGTACA CAACAGGCTC 
8 281 AGACGCTGGC GGATTATACC GCAGAGCGCC TGAGTAACGT GTTGTG CAAT TGGTTTCTGG 
8 341 CGAATATCCA GCCAGAAGGG GTGTCCCTGC ACAGCCGGGA TGACCTGTAC AGCTATTTCC 
8401 TGATTGATAA TCAGGTCTCT TCTGCCATAA AAACCACCCG ACTGGCAGAG GCCATTGCCG 
84 61 GTATTCAGCT CTAC ATCAAC CGGGCGCTGA ACCGGATAGA GCCTAATGCC CGTGC CGATG 
8521 TGTCAACCCG CCAGTTTTTT ACCGACTGGA CGGTGAATAA CCGTTACAGC ACCTGGGGCG 
8 581 GGGTG TCGCG GCTGGTTTAT TATC CGG AAA ATTACATTGA CCCGACCCAG CGTATCGGGC 
8 641 AGACCCGGAT GATGGATGAA CTGCTGGAAG ATATCAG C CA GAGTCAGCTC AG CCGGG AC A 
8 701 CGG TGGAAG A GGCCTTTAAA ACTTACCTGA CCGCTTTGAA ACCGTGGCAG ACCTGAAAGT 
8 761 TGTCAGCGCT ATCACCGACA ACGTCAACAG CAACACCGGA CTGACCTGGT TTGTCGGCCA 
8 821 AACGCGGGAG AACCTGCCGG AATATTACTG GCGTAACGTG CATATATCAC GGATGCAGGC 
8 881 GGGTG AACTG GCCGCCGATG CCTGGAAAGA TTGGACGAAG ATTGATACAG CGGTCAACCC 

8 941 ATACAAGGAT GCAATACGTC CGGTCATATT CAGGGAACGT TTGCACCTTA TCGTGGGTAG 

9 001 AAAAAGAGGA AG TGGCG AAA AATGGTA CTG ATCCGGTGGA AACCTATGAC CGTTTTACTC 
9061 TGAAACTGGC GTTTCTGCGT CATGATGGCA GTTGGAGTGC CCCCTGGTCT TACGATATCA 
9121 CAACGCAGGT GGAGGCGGTC ACTGACAAAA AACCTGACAC TGAACGGCTG GCGCTGGCCG 
9181 CATCAGGCTT TCAGGGCGAG GATACTCTGC TGGTGTTTGT GTACAAAACC GGGGTGAGTT 
9241 ACCCGGATTT TGGCGACAAC AATAAAAATG TGG CAGG CAT GACCATTTAC GGCGATGGCT 
9 3 01 CCTTCAAAAA GATGGAGAAC ACAGCACTCA GCGTTACAGC CAACTGAAAA ATA C CTTTG A 

93 61 TATCATTCAT ACTCAAGGCA ACGACTTGGT AAGAAAGGCC AGCTATCGTT TCGCG CAGG A 
9 421 TTTTGAAGTG CCTGCCTCGT TGAATATGGG TTCTGCCATC GGTGATGATA GTCTGACGGT 

94 81 GATGGAAAAC GGGAATATTC CGCAGATAAC CAGTAAATAC TCCAGCGATA ACCTTGCTAT 
9 541 TACGCTACAT AACGCCGCTT TCACTGTCAG ATATGATGGC AGTGGCAATG TCATCAGAAA 
9601 CAAACAAATC AGCGCCATGA AACTGACGGG GTTGGATGAA AGTCCCAGTA CGGCAATGCA 
9 661 TTTATCATCG CAAATACCGT TAAACATTAT GG CGG TT ACT CTGATCTGGG GGGCCCGATC 
9721 ACCGTTTTTA TTAAAACGGA AAAACTATAT TGCATCAGTT CAA GG CCA CT TGATGAACGC 
9781 AGATTACACT AGGCGTTTGA TTCTAACACC AGTTGAAAAT AATTATTATG CCAGATTGTT 
9841 CGA3TTTCCA TTTTCTCCAA ACACAATTTT AAACACCGTT TTCACGGTTG G TAG CAAT AA 
9 901 AA C CAGTG AT TTTAAAAAGT GCAGTTATGC TGTTGATGGT AATAATTCTC AGGGCTTCCA 
9961 GATATTTAGT TCCTATCAAT CATC CGG CTG GCTGG ATATT GACACAGGTA TTAACAATAC 

10021 TGATGTCAAA ATTACGGTGG TAG CTGGCAG TAAAACCCAC ACCTTTACGG CCAGTGACCA 

10081 TATTGCTTCC TTGCCGGCAA ACAGTTTTGA TGCTATGCCG TACACCTTTA AGCCACTGGA 

10141 AATCGATGCT TCATCGTTGG CCTTTACCAA TAATATTGCT CCTCTGGATA TCGTTTTTGA 

10201 GACCAAAGCC AAAGACGGGC GAGTGCTGGG TAAGATCAAG CAAACATTAT CGGTGAAACG 

10261 GGTAAATTAT AATCCGGAAG ATATTCTGTT TCTGCGTGAA ACTCATTCGG GTGCCCAATA 

10321 TATGCAGCTC GGGGTGTATC GTATTCGTCT TAATACCCTG CTGGCTTCTC AACTGGTATC 

10381 CAGAGCAAAC ACGGGCATTG ATACTATCCT GACAATGGAA ACCCAG CGGT TACCGGAACC 

10441 TCCGTTGGGA GAAGGCTTCT TTGCCAACTT TGTTCTGCCT AAATATGACC CTGCTGAACA 

105C1 TGG CGATG AG CGGTGGTTTA AAATCCATAT CGGGAATGTT GGCGGTAACA CGGGAAGGCA 

105 61 GCCTTATTAC AG CGG AATGT TATC CG ATA C GTCGGAAACC AGTATGACAC TGTTTGTCCC 
10 621 TTATGCCGAA GGGTATTACA TGCATGAAGG TGTCAGATTG GGGGTTGGAT ACCAGAAAAT 

106 81 TACCTATGAC AACACTTGGG AATCTGCTTT CTTTTATTTT GATGAGACAA AACAGCAATT 
10741 TGTATTAATT AA CGATG CTG ATCATGATTC AGGAATGACG CAACAGGGGA TCGTGAAAAA 
10801 TATCAAGAAA TACAAAGGAT TTTTG AATGT TTCTATCGCA A CGGGCT ATT CCGCCCCGAT 
10861 GG A TTTCAAT AGTGCCAGCG CCCTCTATTA CTGGG AATGT TCTATTACAC CCCGATGATG 
10921 TGCTTCCAGC GTTTGCTACA GGAAAAACAA TTCGACGAAG CCACACAATG GATAAACTAC 
10931 GTCTATAATC CCGCCGGCTA TATCGTTAA C GGAGAAATCG CCCCCTGGAT CTGGAACTGC 
11041 CGGCCGCTGG AAGAGACACT CCTGGAATGC CAATCCGTTG GATGCCATTG ATCCGGATGC 
11101 CG7CGCACAA TATGACCCGA CACACTATAA AGTTGCCACC TTTATGCGCC TGTTGGATCA 
11161 ACTTATTCTG CGCGGCGATA TGGCCTATCG CG AACTG AC C CG CGATG CGT TGAATGAAGC 
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csssss sssss sssss sssx? asssss 

TCAACAAGAC CTTACGGCGC TAGACAACGG AGAAGGTTTP ^^5° AAGCGGGCTA 
CTCGTTGGTG GTTTGGTCCT GcSSiSS A^SSS 2SE£?S£ GCAACGCTAA 
TGCGTTTGCG CCTGGTTAAC CTCCGCCATa irrrS^I CAACCGATTA CTGGCAAACC 
GGCGAATTAC GCGAGCCTAC GATCotSS cZSZSnn ^'^ GCAAC CGTTATCGCT 
AGGGCGGTAG TGCaIScS SSSSJi? SSSSSS ^r^5 TA ^CCTTCTC 

agcgggcccg caatctcgta gcgcaatSI SSSSSS gtgatgctgg 
cagagcatga tgatgccgat gaactcacca cgttgSa?? ^z^Z? ctcag tatgg 

CGACACAGAG CATCCGTATT CAGCAACGAA rrcrr^^I tS^^S® 57 ATGGAACTGG 

tattggcaga gagccgccgc Stg^SJJ 5?ct§££ GATArrccrc 

ACGAGGATAT CAACCACGGA GAACAGCrTr rr^n^^ AAAATACCAG CAG CTGTATG 
AGTCTCTGGC CGGgSSg CTCTCAGTAG cSSS SSSKS GCGGCAGGTC 
TGTTCGGTTT CGCTTGTGGC GGCAGTCGTT GGGGGrrSr ^£T? A £T TA GTTCCAAACG 
TGATGTCGCT TTCTGCCACA GCTTCcSS SS£Sr£ r^^^ TCC GCCTCCG 
CCTACCGCCG CCGCCGTCAG GAGTGGGAaI TTCAGCGTr^ S??2£S£ C CGTTCGGAAG 
AACAAATGGA TGCCCAGCTG GAAAGCCTGA USaSSS GGTGAAGTCA 
TGGAATATCA GGAGACCCAG CAGGCCCATA CTCAGGCT^ £5^^°* CAGATGCAGG 
AATTCACAAA CAAAGCGCTT TACAGTTCGA TOmXIS^ ™ GAGCTG ™CAGCGTA 
AGTTCTTTGA CCTGACCCAG TCCTTcSc- TCAttSSSJ r^ZJZ^ ATC TATTACC 
TGACCGACAA CGGTGTTACC TTTATCCGGG rS^i GGAAGCGCTG CGCCGCGAGC 
TGATGGCGGG TGAAACGTTG CTCCTgS?- S^afT? GAACGGTACG ACTGCGGGTT 
GTGATGAGCG GGCACTgSK SSSSSa" S^cS? ZSSHSSS l^™^ 
TATCATCAGA CAACTTTAAT CTGACCGAAA AACTcIrrJI ^ZSSS™ 2 TA TCAGGCCT 
GCAACGTAGG AGCTTCCGGC AATGAATTAA ASSSSS JS^^I GAAGGGAAAG 
TGCGATTGTC TGATTTGAAA ATTTTCAG rr ^lArr-?^* *?°i£ AG i TA GAAGCCTCAG 
AGTTGAAACA AG TGAGTGTC ACCtSSSS SSrS AAGCTTTGGC AATACCCGTC 
CGGTGCTGAA TTACGGcIgC ScScS? G £ A £ CGGG 

CCCACGGCGT GAATGACAGT GGTCAATTTA TGCTGGaS? TZ*^*^nl A J TGCTCT CT 
12 961 CGTTTGAAGG TATTTCCGTG AATGACAGCG -TAr^rrJI? ^E^TIS C '-GTTATCTGC 
13021 CTGATCGACA GAAAGCGCTG CTGGaSgCC TCAGpS?at rT^IF C ^CGGATGCGA 

sssss j™£ iH£? 
ss esse ss»™ ™J S ££££ 

CGGATGGGGA. AGTGTTG AGT ATTGTGCCGC ArlrrrAA^ ^2^ GATGAG TATCTCGGGC 
CAACCTCACT GTTGGGGaS SSS?JS? JtcSSKS ^^CG 
GCGTGGCAGA AAAAATCGTT CGTTTAGAAC AcSSSr IrTZrin^ TATCAGT CCC 
AGACGTCTTT TTGGGTA CTT TTTACTGCGG iSSSSS r^™^ CGTGAGGA AG 
ATCATGCACG TATTGCTGAC SS JSSSS raSJJS 
AAACCGTCAC GCATA CCGGG GAACATATTT A CTATCACTA TOQOmSS rlrrl^Z 
ACTGTGATGA GCATGAACTT GCTCAGCATT rArr'n-rrir S^riS^ GACGATCTTG 
AGTCCACTAT GGCAATACTC AGCCGGaJU SSSr 55S? GA ^ GT TATCCTGGCA 
TGTTGATAAT GACTGGTTGT TTCATCTGGT A?SSS? SSSSE £** TATCCC 
GAACTCCGTA CCCGAATTCA ATGTCtSgA AAaSStctc SSSSS^T F^™* 07 
AAAATGGCGT TGTCGTCCGG ACAGTTTCTC CCgSJtcSS SJSSJ^ 
CCGTCGCTTG. TGTCGCCAAG TTCTGATGTT TCATCAGrrr lal^^ZS AAATrCGAA ^ 
GGTTGCAGAA GAAACACCGG CGCTGGTTtJ JS?S?aS ?^S5SJS£ CAGGGGAAAA 
CAAGGTTTCC TTGCTGCAAA CGGCCCGCAG ACTgSSS ™^^ G AC CTCAACAA 
GATGATGTCC CCG CTGGAAA TGGATTATCA AcJtcS^t GTA ^CCAGT 

gcagtccatg ccgcagttag SSHS SSS^J SISc?S- 

ATATGGAGAA GGAATTTCCG GCGTTACTTT ATCAGGArTr TGGTTGATTT 

gtgctccggt acgggatatc aSS?SIJ Si^Sc SJSotS r ™ r cc 

AACCACTGCC ACATATTCCG GCACAACAGG AAAGrrrr^r SSZZ^S£T AT G AGGAGGCGA 

acgggcgtct ggattgggtg attacggcat ^^^Icr r^?^ AC ATCAATG ^^ 

CGGAAGGTGA ATGGACACCC TTTATTCCAT TAtcSSS ^fT^ C A CCATGTCAC 

cgcaggcaaa actggctgat attS^gg l^^c tcSS??^ I^ ccatc 

CAAATAGTGT ACGTGTCTGG TCAAATAATC CGGCAGGATr ^S£^ G CTT ATCGGGC 

ttcatttgtc aaataagcca ctgJ?gJSc gJJcG?r^ ^^S^ 

TCAGTGATAT GACAGGCTCC GGG CAATCAC ATCTH^rV T^SSS TCAT ^GTCGCAT 
GCTACTGGCC GAACCTGGGG StGGAaS^ ^G^^rr ^II^ GCA ^ TA ^^^GC 
AAATTACGGG GAAACGTTTA 3S??ff?J I7t^^^ I^I^ TA ACAGGCTTCC 
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-5061 CACCACCCGA TTTTATTTAT GCCCGCAATA CTTACCTTGA ACTCTATGCC AATGAAAGCG 

IS 121 GCAATCATTC TGCTGAACCT CAGCGTATTG AT CTGCCGG A TGGGGTACGT TTTGATGATA 

15181 CTTGTCGGTT ACAAATAGCG GATACACAAG GATTAGGGAC TGCCAGCATT ATTTTGACGA 

15241 TCCCCCATAT GAAGGTGCAG CACTGGCGAT TGGATATGAC CATATTCAAG CCTTGGCTGC 

15 3 01 TGAATG CCGT CAATAACAAT ATGGGAACAG AAACCACGCT GTATTATCGC AGCTCTGCCC 

153 61 AGTTCTGG CT GGATGAGAAA TTACAGGCTT CTGAATCCGG GATGACGGTG GTCAGCTACT 
15421 TACCGTTCCC GGTGCATGTG TTGTGGCGCA CGGAAGTGCT GGATGAAATT TCCGGTAACC 

154 81 GATT GACC AG CCATTATCAT TACTCACATG GTGCCTGGGA TGGTCTGGAA CGGGAGTTTC 
15 541 GTGGTTTTGG GCGGGTGACG CAAACTGATA TTGATTCACG GGCGAGTGCG ACACAGGGGA 
15601 CACATG CTG A ACCACCGGCA CCTTCGCGCA CGGTTAATTG GTACGGCACT GGCGTACGGG 

156 61 AAGTCGATAT TCTTCTGCCC ACGGAATATT GGCAGGGGGA TCAACAGGCA TTTCCCCATT 
15721 TTACCCCACG CTTTACCCGT TATGACGAAA AATCCGGTGG TGATATGACG GTCACGCCGA 

157 81 GCGAACAGGA AGAATACTGG TTACATCGAG CCTTAAAAGG ACAACGTTTA CGCAGTGAGC 
15 841 TGTATGGGGA TGATGATTCT ATACTGGCCG GTACGCCTTA TTCAGTGGAT GAATCCCGCA 
15 9 01 CCCAAGTACG TTTGTTACCG GTGATGGTAT CGGACGTGCC TGCGGTACTG GTTTCGGTGG 

15 961 CCGAATCCCG CCAATACCGA TATGAAGGGG TTGTTACCGA TTC CACAGTG CAGCCAAAAG 

16 021 ATTGTCCTTA AATATGATGC GTTAGGATTT C CG CAGGACA ' ATCTTG AG AT TGCCTATTCG 
16081 AGACGTCCAC AGCCTGAGTT CTCGCCTTAT CCGGATACCC TGCCCGAAAC ACTTTTCACC 
16141 AG C AGTTTCG ACGAACAGCA GATGTTCCTT CGTCTGACAC GCCAGCGTTT TTCTTATCAC 
16201 CATCTGAATC ATGATGATAA TACGTGGATC ACAGGGCTTA TGGATACCTC A CG CAG TG AC 
162 61 GCACGTATTT ATCAAGCCGA TAAAGTGCCG GACGGTGGAT TTTCCCTTGA ATGGTTTTCT 
16 321 GCCACAGGTG CAGGAGCATT GTTGTTGCCT GATGCCGCAG CCGATTATCT GGGACATCAG 
16381 CGTGTAGCAT ATACCGGTCC AGAAGAGCAA C C CG C T ATT C CTCCGCTGGT GGCATACATT 
164 4 1 GAAACCGCAG AGTTTG ATG A ACGATCGTTG GCGGCTTTTG AGGAGGTGAT GGATGAGCAG 
16501 GAGCTGACAA AACAGCTGAA TGATGCGGGC TGGAATACGG CAAAAGTGCC GTTCAGTGAA 
16561 AAGACAGATT TCCATG TCTG GGTGGGACAA AAGGAATTTA CAG AATATG C CGGTGCAGAC 
16 621 GG ATT CT AT C GGCCATTGGT GCAACGGGAA ACCAAGCTTA CAGGTCAAAC GACAGTGACG 
166 81 TGGGATAGCC ATTACTGTGT TATCACCCCA ACAGAGGATG CGGCTGGCCT GCGTATGCAA 
16 741 G CGCATTA CG ATTATCGATT TATGGTTGCG GATAACACCA CAGATATCAA TGATAACTAT 
16 801 CACACCGTGA CGTTTGATGC ACTGGGGACG GTAACCAGCT TCCGTTTCTG GGGGACTGAA 

16 861 AACGGTGAAA AACAAGGATA TACCCCTGCG GAAAATGAAA CTGTCCCCTT TATTGTCCCC 
16921 ACAACGGTGG ATGATGCTCT GGCATTGAAA CCCG3CATAC CTGTTGCAGG GCTGATGGTT 
16961 TATGCCCCTC TGAGCTGGAT GGTTCAGGCC AG CTTTT CTA ATGATGGGGA G CTTTATGGA 
17041 GAG CTG AAA C CGGCTGGGAT CATCACTGAA Z-TZZTT^TC TCCTGTCGCT TGCTTTTCGC 
17101 CGCTGGCATC AAAATAACCC TGCCGCTGCC ATGCCAAAGC AAGTCAATTC ACAGAACCCA 
17161' CCCCATGTAC TGAGTGTGAT CACCGACCGC TATGATGCCG ATCCGGAACA ACAATTACGT 
17221 CAAACGTTTA CGTTTAGTGA TGGTTTTGGG CG AAA GCTT A CAAACAGCCG TACGCCATGA 

17 281 AAGTGGTGAA GCCTGGGTAC CTG ATG AG T A TGGAGCCAAT GTGGCTGAAA ATCAAGGCGC 
17 34 1 CCCTGAAACG GGCGATTACA AATTTCCCGT TGGGCAATTT CCCGGACGTA CAGAATATTA 
174 01 ACGGGAAAAG GCAAAGCCCC TGCGTTACGT TTGAAACCGT ATTCCTGAAA TAATTTGGGC 
17461 AACTATGTCA AGTTGACCAA AAAATGCCCG GC AGO AT ATG TATGCCGATA CCCATTACTA 
17521 TGATCCGTTG GGGCGTGAAT ATCAGGTTAT CACGCCAAAG GCGGGTTGCG TCGATCCTTA 
17581 TTCACTCCCT GGTTTGTGGT GAATGAAGTT GAAAATGACA CTCCCGGTGA ATGACAGCAT 
17 641 AAA GCTCAGT GATGCCTGTT CACTGAACAG ACATCACTGC ATTTAGGAAT GAATCATGAA 
17 701 GAATTTCGTT CACAGCAATA CGCCATCCGT CACCGTACTG GACAACCGTG GTCAGACAGT 
17761 ACGCGAAATA GCCTGGTATC GGCACCCCGA TACACCTCAG GTAACCGATG AACGCATCAC 
17 821 CGGTTATCAA TATGATGCTC AAGGATCTCT GACTCAGAGT ATTGATCCGC GATTTTATGA 
178 81 ACGCCAGCAG ACAGCGAGTG ACAAGAACGC CATTACACCC AATCTTATTC TCTTGTCATC 

17 941 ACTCAGTAAG AAGGCATTGC GTACGCAAAG TGTGGATGCC GGAACCCGTG TCGCCCTGCA 
18001 TGATGTTGCC GGGCGTCCCG TTTTAGCTGT CAGCGCCAAT GGCGTTAGCC GAACGTTTCA 
18061 GTATGAAAGT GATAACCTTC CGGGACGATT GCTAACGATT ACCGAGCAGG TAAAAGGAGA 
18121 GAACGCCTGT ATCACGGAGC GATTGATTTG GTCAGGAAAT ACGCCGGCAG AAAAAGGCAA 
18181 TAATTTGGCC GGCGAGTGCG TGGTCCATTA TGATCCCACC GGAATGAATC AAACCAACAG 
18241 CATATTGTTA AC CAG CAT AC CCTTGTCCAT CACACAGCAA TTAGTGAAAG ATGACAGCGA 
183 01 AGCCGATTGG CACGGTATGG ATGAATTTGG CTGGAAAAAC GCGCTGGCGC CGGAAAGCTT 
183 61 CACTTCTGTC AGCACAACGG ATGCTACCGG CACGGTATTA ACGAGTACAG ATG CTGCCGG 
18421 AAACAAGCAA CGTATCGCCT ATGATGTGGC CGGTCTGCTT CAAGGCAGTT GGTTGGCGCT 
18481 GAAGGGGAAA CAAGAACAAG TTATCGTGAA ATCCCTGACC TATTCGGCTG CCAG CCAGAA 
18541 GCTACGGGAG GAACATGGTA ACGGGATAGT GACTACATAT ACCTATGAAC CCGAGACGCA 
186 01 ACGAGTTATT GGCATAAAAA CAGAACGTCC TTCCGGTCAT GCCGCTGGGG AGAAAATTTT 
18661 ACAAAACCTG CGTTATGAAT ATGATCCTGT CGGAAATG TG CTGAAATCAA CTAATGATGC 

18 721 TGAAATTACC CGCTTTTGGC GCAACCAGAA AATTGTACCG GAAAATACTT ACACCTATGA 
18781 CAGCCTGTAC CAGCTGGTTT CCGTCACTGG G CG TG AAA TG G CG AATATTG GCCGACAAAA 
18 841 AAACCAGTTA CCCATCCCCG CTCTGATTGA TAACAATACT TATACGAATT ACTCTCGCAC 
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TTACGACTAT 
AACTATACAA 
GCGCAAGATC 
GTTCCCGGTC 
AATAGGGAAA 
G TCATTAAG A 
TTGCCAGAGC 
GTCATGACTG 
AAACCGGCGG 
AGCGGGCTGG 
TGGGGGAACC 
TATTCTGGCA 
TCGTGGACAG 
TTCCGAATGT 
CAGGGTGTCC 
GAACACCTGC 
CGAACGTTTG 
AGCGTCGCCT 
GGGG TTTTTC 
TAAA CGTTCT 
ACTATTTAAC 
AGGATTAATG 
CGCCGGACAA 
AGGGCACTAT 
GAGCTGGGTG 
GG TAATCTAT 
AGCTCTGCCA 
AGTGTCGGGA 
GTTGGTGCAG 
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GATCGTGGGG 
CGAACATGAC 
CCACTCAGGT 
AGGATCTTTT 
ATACGACGCC 
CTCATATTCA 
TGGAATGGCG 
TCGGTGAAGC ■ 
ATATCAGCAA 
AATTGGGACA 
GCCGTGTGGG 
AAGAGCGGGA 
GGCGATGGTT 
.G CAGGAATAA 
TTG CCTGGAT 
T TGA ACAAGG 
TTTTGGGTGT 
TGGGGGATCG 
GCGAACAACA 



GCCAATGCCG 
TTTAACGCCT 
TG7CATGGAT 
GTCCAGTTAA 
CAGGCTGAGC 
GCCTGTATCG 
CTGGAATTGA 
CATCATTGTA 
GGCAGGTAAC 
ATCGGTAGCA 
GTTCAAACCG 
TTT A T CTTTT 
GGTTCTGGTG 
TAAATTCAGC 
TGGTTTAATA 
TAAACAGAGT 
GAGCCAGTTG 
AGTTTGTGCC 
AGCCAGCGTC 
AATTTCCCAC 
AATACGTGTT 
AAAACGGGAA 
GGCCATATGC 
CG CTTGTTTT 
AGACTGTGCA 
GACCTTATCG 
TTCAG CCAAG 
TATCAACTGG 
CATCACTGCA 
CATGGACGGA 
CAGGTTATGG 
TT7ATTAATT 
ATGCAGCGCG 
TGGCAGCTTC 
CACCCAATAA 



GCTCCTGTTC 
AGCTCTTCGA 
GCTTTAGCCG 
AGTACGCTGG 
CAGGCGCAAT 
AA CGGGCAGC 
GGGATGGCCC 
TTTCCCACGC 
GAAATATTTC 
CCATTGGCGG 
CTAGCCGGGT 
CTG CACGTCA 
GACAAGGTGG 
TTTTTGGATC 
AATAAG CTTT 
GCCACAGGAA 
CCACTGTCAT 
CTGCCGCCAT 
GTCACACTGA 
ATATTCAGAT 
TTAAGCGTTG 
AAAATGAAAC 
ACCTCCAGTG 
ATCAGGGTTT 
AAGTGTGCTG 
GAGACCGCCA 
TATAGCGCTG 
TGCTGAGTTT 
GCTAATTGAG 
TCTTGCCGAC 
GCTGACGCAG 
CCTCCCACAG 
AGGGCTGTGC 
TCACCAGCGT 
CCATGACGGC 
TGCATTTTAA 
GCTTCTGCAT 
CTTACCCCCC 
TGAGGTAAAT 
TCTGCGGGCG 
CGTAAGTTAT 
TGAGGGAGGA 
CTGACGCAGT 
TTCCAGCCGT 



GAATCTGACC 
CGTTTCAGAT 
GGATATGTTG 
CTGGACACCC 
TGATCAGGAA 
GAAGACAGGT 
CACGACATAT 
GGGTCAGGCA 
TGATCAGCTG 
GTGACGGGCA 
CACCCGAAAT 
TGCAACAGGG 
GAGTGTAGAT 
CCCCATCGTT 
AGGGAAAAAA 
CGCTTCCTTT 
GGGGGTACAA 
TCGGGGCTGC 
TCTCAGAAAA 
AGGTAGGCGC 
CAGGTAC CGC 
GAGAACATAA 
ATACGCTCAG 
TATTGGCGGC 
GATTGGTGCT 
TTA7CGGTTT 
TGTCAGTTCC 
TGAAGTATTA 
GACAGCCGCG 
TACCTGGAGC 
TAATGAATC 



AGAATCGCAT 
CACAGCAACC 
TTCACCCCCG 
CGTGACGAAT 
TTCTACCGTT 
AACAGTGAGC 
AGCGGCAATA 
CAAGTGCGGG 
CG CTACAGTT 
GATCATTAGT 
CAGTCAGAAG 
TTG T ATT ACT 
CCTG CCGGTG 
TTTTCTGATT 
G CGTATCG AA 
G ATA CGTTCT 
GTCTGGGGGT 
CATTGGTGGT 
AA TT GGGG AA 
TTTTGTTGTC 
CATTTCCGCA 
CACGGGCATG 
GCCCGGTAAT 
ATATTACTTG 
ATGTATGGTG 
ATCGGCAGGT 
AGGAGCTGGT 



AGTGAGCGCC 



GTTTTTCGGA 
AAG AACG AA T 
TCTGTTTA CC 
GCCCTTCAAA 
TCATG CCATG 
AACTCAGTAT 
TTTGTTTGAT 
CCGATAATTT 
TGCCTGCACT 
TATTTTCTGT 
CCCGTTCATC 
CACCCGCTAA 
CCG CATTATT 
AATCATAAAA 
CATTACTGAA 
CCAGATAGTT 
CATCAATTTG 
GGCGACGGTA 
AAATTTCGAT 
CAAAACCGTA 
CGCTGGTGCT 
TAACATCTTC 
TTTCTTGAAG 
TACTTTGCAG 
CCTGCCGTTC 
ACTTGGCATT 
CGCCGCCGCC 
TGGCATAGAG 
AGAGGCGTTG 
ACAATGCGGT 
TGCAGCATTT 
GGCTCTGACC 
TGTACATACC 



TTACCTTATA 
GCCGCTCATC 
GGCTTTAAGC 
GAAGCATAAC 
TGTGTGGACA 
GGTGTAACGG 
- ATA CCG 
i «-?G CAGGTAC 
TGAGATCACA 
TGCCCGGACA 
ACGGCGTGTA. 
GAGGCTGGCT 
GCCTTCACCT 
CAGACCAGCA 
TTTTTCCAAA 
TAAACCCGCA 
CAATTCATAC 
CTGATAATAA 
TTTACTTTGC 
TTTTTGTAAT 
TT TTA TCTCA 
TATTTCTGAT 
ACCAATCGCA 
AATATTGGGG 
CAAGACCGAT 
GTCGTACAGC 
CGCCAATTTA 
GGCTAACTCA 
AGTAATGCTG 
TTCCAGAATC 
TTGTGAAGCA 
AGATAATGAC 
CGTCAATGTC 
TAACGAAATT 
TATGTTGATA 
AATCGTTATC 



.CATTTTAGC 



AATTCACGAT 
GGG CTG TACT 
GCGGG CATCA 
TGCAACAAGT 
ATGATGCAGA 
AAATACAGCG 
CATTAAAAGA 
TG CTG CATTG 
ATGGCAACCT 
CAGGAAGAAT 
CTGATTACAC 
ACGGCTATCG 
AGGCCGATGG 
CTGA7GGTCG 
AGGCAGTCAA 
TGAAATTAAA 
GAAGCGGCCA 
TTTGT CTCCG 
GTTTTAAGTT 
ACA TCGCTTG 
GCAACAGCGG 
GCTATCAGTA 
GTCAGCGCGC 
GCCGCCATCA 
CTCGATGGGG 
TACTGCTCAG 
TTGGCCGAAT 
GCCGTACACC 
ATGCCGTTGG 
GGG CTTTT AA 
AATCATGTTC 
GAGACCCGTA 
ATATGCAAAA 
GGAAAACTGA 
TTAGCATCAT 
ATCGCTTTGC 
TCCTGATAAG 
TTACCTAAAC 
TGCAGTTGTG 
G CATTG ACTA 
TACACTTCAG 
TAGCTTTTTT 
TAAGTCCCAT 
TGATAAGTTT 
ATAGCGGACA 
AGAAAGGCTA 
ACTGCCGCTT 
GCTTCCGCAT 
TGGCTGATTT 
CTGGCATTGA 
ACGAGATCTG 
GAAGAGAGGT 
GTATTGAAAC 
TCAGCATCAA 
CTGCCTTGAG 
AGCAGGGTAT 
ACCGGAAAAC 
GTGATGGCAG 
AGTGGCTGAC 
TGCCAGTAAC 
TGCCGTACGT 
ATGATGCCGC 
CAATGAAAAA 
TTCGTTTAAG 



CACCGGTAAT 
GG AAG AG CTG 
GACCCGGCTT 
GATATTGGTC 
CAGTCAGCGT 
AACATTATAT 
GTTTTTG CAG 
GGAAACAGGC 
GATTGGCAGT 
AITACCCCTA 
AAGCCGGCGT 
TTATTATCAA 
TCTCAATTTG 
TTTCCCCGGT 
CATCACGACA 
CCGAGGATTG 
CG A TTG CA GG 
GGGCGG TGAT 
ATCTG ACGCG 
TGACGTCTGC 
TCACCGTTGG 
TTGCCACACC 
CAG AG CGGTT 
GGG AAGTTCT 
AAGGATCATT 
AAGAGGCATT 
GATAGGAGAA 
CGGTGAATGG 
AGGGGAAGTT 
TAACTTCTTC 
ATTCCCACTT 
CAGGGTCTCT 
TGATATCGCT 
GGG TTAATGT 
TGAAATCCAT 
AG CCA CGTGG 
GCCCTAAAAG 
CGTCAGGATA 
TCCCTTCGAC 
ACTCAGTCAC 
C CAG AG AAA C 
CCATCTGTGC 
GCCAAGCACC 
GCTCTGCCAT 
ACGTTCCACG 
ACTGCGCCTG 
CACGACGTAC 
TATTGCGCTG 
TGTCTGCGGC 
AAAGCGCCCC 
CCGCGGCGGC 
AAAGATCCAT 
TGTCAAAACG 
TTTCAGCCAT 
TTTGCAGTAT 
TGCCAAATTG 
GGTACATCGG 
CACTGAGTAA 
CGTCGATTGT 
CTTGCAGTTT 
TTCG TGGGTA 
ATTGTTTGGC 
TAAGGCTCAT 



GTATCACGTT 
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CAAGCTGGCG ATAGGCGCTA TCTCCGCGGG TAATCAACAA ATCCAGCATT TTCATAAAGG 
TAGCCACTTT ATAGTGCATC GGATCATGCT GGGCAACGGC GTCCGGATCG ACCGAATCCA 
GCGGATTGGC ATTCCAGGAC GTATCTTCCT CCAATGGGCG GACGTTCCAG TAATAATCCT 
G CATTTCAC C CTGAACCGAA TATCCGGTCG GGTTCAGATA TAGCGCAGCC AGCGTGTCGA 
TCCGGTAAAA TCTGCTCTTG CAATAAGCGC TGGAATACCA TCATGGGCGT TGTAATAGAA 
CAATCCCAAG AAATAGATTG CATTGGCGCC GTTTGAAATC CATGGGTTCA GTGTTATTTT 
TCATGACACG A CTTGAATAC CCCTTTTATA TTTTTTGATA TTTTTTACTA TCCCCTGTTG 
TGTCATTCCC GAATCATGAT CGGCATC ATT AGTGAATATA AATTGATTTT TCGTCTCATC 
AAAATAAAAG AAAGCAGATT CCCAGGATTT GTCATAGATA ATTTTTTTGT ACCCAACCCC 
TAATCTGACA CCTTCACGTA TGTAATATCC TTT AG CATAG GGAACAAAGA GCGTTACTGT 
GGTTTCAATA TCAGATAACA TTCCTTCGTA ATAAGGTTGT CTGGCAGAAT TGCCATCAAT 
ATTCCCAATA TGGATCTTAA ACCAACGTTC ATCACCATGC TCCTCTTTAT TGTAGGGGGG 
CAACTTAAAT GTCGCATAAA ACCCTTCACC TAATTGCGGC TCTGGTAAAT TTTGCGTTTC 
CATACTTAAA ACATTATCAA TACCAATATT GGCTCTTTCA GCTAATTTTC TGGAAAATAA 
AG TATTTAAC CGGGTTCTGT AAGGGCCAAT CTGCATATAT TGTGTGCCTG ATGGCATTTT 
ATGCAGTGAT ATAACGTTAC TTGTATCTTT GGATTTTAGT TTTATATGAA TTGGCGATTC 
AATAACAATA TCG TTATAAC CGCCGTCGGG TTG CTTAAT A ATAAACTCGC TCACCAGAGG 
AATATCATAG CCTTCAATAT CAACTTTTAC TTGATTAAAA TCATATACCA TAGGGTCAGA 
TTCGTGTGAA GG TTTAGATG CCACATGGTC TTCAGCATTT AACTCCACTA GAATATCAGA 
GCCATTTTTT AATAAAAAAC TAATGTTTTT ATCTTGGATC TGTTCGATCA TAG ATG AAG C 
AAGTTTTATT ATCTGTGGCT GGTTGAACAT AAATACACCC ATGGATCCTC GCGAAGGAAC 
AGTGCCGCAA TATTTCCCAT GTTATTAATG A TTG AAA CAT CA TT AG T AAA TGATTCACAT 
ATAGTATGCC ATACTCCTGT GTTATCTTTC CAATCTAATA CTATGTTAGT ATCAAGTTTG 
AATTCAGCAT CATCTGATTC ATAATCATAA TTT AT A C CAA CTCCAATTTC TGATTTTCTA 
GGAATTTTTT CCTTGGTTCT TAGATGCATT AACACTCTAA AATATTCGGC ATTTTTAAGA 
TCGATGGAAA TAATAAAATC CAAAGTTCCA TAATGAAAAA CTTCTTCTTC TTTTCCAAGC 
ATTTCATCAT GTCTATCATA ATCAAATAAA ATAACCGTTT CATCTTCTAC CATCGATAAC 
AGGTATTTAA CCTCATCATT ATATATATTG C CTTTTG AAA AATTAATTTC GATTGAAGGA z% 
TTGAACGTTA AATTAATATG ACCATTTCCT GGTGATATAT ACGAGAGATC AAAAATATTT £ 
CCGGTAAAAC TGG CTAATTT ATTTTTTGTG GTTATAGATT CCTTATATTC GGCCAAATAA \J 
TCT3TAGCAA ATTGATTGTT GACTTTGTAT TCTGTCCTGG TATCAAGTTC TGATAATGTG 
CTCTTAACAA TGGCGTCTAA ATCATTTTCT GTGAGAATGG ATAATGTCAT ATCAGGG TT A W 
ATG GTCATCC CTTCTCTTGC AGGAAGACTA TTAAAAGAAT AATTGTCTTT TTTCTCATGG 
AAATAAACAA TAATGACGTC TTTTTCATAA TCAGAAGAAC AATACATACC AATGCTGGCT ■ 
TTTTTATTGA TCAGGTTTTC TATTTTATCA GTCA CATTAA AATT AAA CGG TGAGCTCCAG 

.CTGCCAT CAT AACGAATATG TGACAGTTTT AATATATAAT CAGTGATATC TATCTTGCCA '|j 
TCTTCACTTT CATTTTTCAG CTCTTTTTGT TCCAGCCACA GTAAATACAA ACGAGACTTG % 
TAAATAACAG GTCTGATATT TTCCTGCCAT ACATTGATGG GTATTTCAAT TTTTTTCCAT J> 
TCTCCCCAGG CATTGGCAGC AAATTG AC CG TGCTGGCACT TTTGGTGATC GACATTGCGC 
CAATAATATA TTCTGGGTTC TGTCTGGCTA TAACCAATTA AATAAGTGAG CCCCTCATTG * v 
ACATTAATAC TGTCATGATA TCCGCTAATC ACCTGCAAGT TAG CG ACATC TTCAAATGCG 

.- GTCAGATAAT TTTTAAAGCT ATCTTCAACG GTATCGATAT TT AA CTG ACT TTGGGAAAGT 
TG CTGTAACA GGTTGTTCAT CATACCTGTC TGACCAATAC GAATCGTGGG GTCGATATAG 
TTTTC CGG AT AATAGGCCAG TTCAGATACG CCGGCCCAGG TGCTATACCG TCG ATTG TAG 
GTTTCCCAGT CGCAGAAGAA CTGACGGGTT TTCACTGGCT TTGATACTTT TCCTTCAACA 
TTATTCAACG CCCGGTTGAC ATATAACTGA ATGCTGGCAA TGGCTTCTGC CACACGGGTG 
GTTTTCACTT GGGCAGAAAC TTGGTTATCA ATCAGCAGAT AGCTGTACAA CTCATCCCGG 
CTCTTAATCT GTTGAGGTGC ACCATTTTTG ATGTAGTAAG CACTGGCCGC TGTCGTCGTG 
GCTTCATCCA GCCATGCCTG AAGCTGGTCG G ATTG TTG AC TGTTCAGTCC CGCCTGCAAC 
AAAGTACTGG CGGCTTGCCA ATCATCAAAT GTTGG CATCG GGG TTTCCGG TTCACCGACA 
TATTTTAATT TTATGAGTGC AGCAACACCA TCCGGGGTAA TACCCAATGT AGCAGCGACA 
TCCAGCCATT GCAGAGTGAC ATCTATAAGT TCTCCAGTTG GTAAAGGTAT TCACTCCCAA 
ACCGGTCTGT TGCAATGCTT GTGTCACAAC CTG AG CATC A AAATTTTAAC GCCACCGCCA 
AATTGTTCGG CAGTCAACGC TCCTAAGTTC C AAATG CTG T TAAGATTCTG TCGCGTAGCT 
TCACAACGCA TGATCACAGC ATGGAAGCGG GTCAGCGCTT GCAAAGTGGG GAGATCATGT 
TGCAGTGCTG TGGTTTCTGA TTGGAATTTC TCCGGTTTTG TCACCAACAG GGTCAGTTCG 
TTTTCG CTG A GTCCAATATT GCGCACAATC AGAGAAAGTT GCCCCAGTAC CTGACAAAAA 
GCCACCATGT TGCTGGTTTC ATTCTCTGAG CGATCACGGT TAGCCGCAAT AATCATGAAA 
TCATCGAATG TCAGTCCTTG TGGTTTTATC TGATTAATCC ACAGCAAAAT AGTTTCTGCT 
GTTTTGGCTG AATCCATTTG AATGCTGGCA GCAAT CAGCG GGG CAG CTG C ACGGATCAGT 
TCGTCATCAC CGAGTGAAAG TGTTGATAAT CCATTACTTA GTGTCGTGAT AAGGTTTTCA 
ATATCCGGCG TAAGGACAGT G CTG T AATT A TCCGTGGTCA TCAGAAACAC ATCACTGACA 
GACCATTTCT GTGTTGTCAG CCACTGGGTG CATTGGAACA GAAAGCTGAT TAATTGCGTT 
AATG CTGTAT CAGAAAAAAG GGCAATTTTC GTGTTCACAT AGGGAGAAAC CG A CAA CAA C 
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iSSSS SSSSSS ?SK£S SS*** «««™ 

CATAATGTAT ACAGTTCAGT ATCATTCACT TTCCGCCATT AGCCAGTTTC 

AAATGGTTTT TTGATTCACC GGGGGTTAAA TCa££5£ SI^? 1 "^ TC CCAAATAA 
TGGCCATTTA ATAGCGGTGT ATTGAACAGC AT^taTT^ JATTATCAGC AGAAAACTCT 
GAATATTGGC TCATATCTCA ATGACACAAT GACTGGGTTG TTGTTTAGTG 

TGCTGCATAT AATATTGAAC ATAaSSS I ScSSI AATATTATAG 
TCATCATAAA TACTTTCTAT TACTTGCCAG ATATCTTCT? S^SS™ AATCGTTAAG 
TACAAACG AA TCGCTTTATT CAGCTTTAAC A^aarT™ 0 . GAGATATGCC TGTCGCTCTA 
TAAAGTGTGC ATTGgStTC ATAGCaTccS i^SS^J CACCGGGAAC TCCATCATTT 
TGTTATACCG TTGOtS^ SSgJJg? SS SiSESS ATAAGCGGAG 
AGCAATGGGG ACGAAATTTT TATCTTGGTA ™»w£S£5 SS^™ 07 ^ 7 AATGGGTATT 
AAAATCCAAG TGGTCAGGTT CTGTTTTTTT SSSSaT ^EJ2£ T TCT GGAGACG 
TTTGATTGGA ATTAG CTCTG CATAGTTTAA ATCTffl2S£ ^T ATTTGT ATTCATTTTC 
CTTAATCAAT CTTGCCGTTG CCGTATCAT? J^GTrSS ^ AGAAATCTT TGCGGGTTCG 
ATTCTTATAC TGTTGATTTG TATTTTTCTT A-rrlS^ ACCAATGTTA TCAGTTGCTC 
27541 TTCATCATAA GACAAATCG? AgSSSc ^JiS^ AGArr SACAA ATAAACTGAG 
27601 ATCATCTGTA CCGAAATTTT SS SJJ^S ^SI^^ AAATCAGTAC 
27661 TACAAGGATT TGATACAATT CAGgSIS? ScaI^S^ SaT^P TAA ^CTTC 
GTCCATTAAT TCCGCTACGT CTGTATTACG GCT^ATCCfe GCGATC TTGG 
TAAAATTGCC TGACGGGCTG ACTCATACGG caSJSSS SS^IZ^ 1^™^ 
GTAAGTGGAC AACATTTTCA TTACACCG TT ATAGTCmJ? CGGTTTGCCG 
ATGCAGCAGT AATTCATTAG ATAAGGA^AA TP T^T 1 Ah a -T^r T^-?! TCTGAATATT 
TGTCAGTGCC AGTGAAG CAA TGTCGGGG CT ^r^^Z JCTTCATCCA TATTATTCTG 
AGGATGAAAA TCTTTCGCTT CcSKSSS AGG J GATA<rr GAGAATTGTC 

GGAAGCAATT GATCCCGGTT TTACAAAACG ^AAGCCGCTG GTGAAAATAT 

ACTATTGTTT AGGGTTGACG GTGTAATATT AAGGTTAr?^ ^^ AACC AACTGTTGTA 
AGCACGGGAC AAAATGCGCA SgSJSXJ ^Sc?S ™55S* GTTGTGGA " 
TTGATATAAA AAGTCTGTTT CTCGCCACGT SgaCT^-I EZ? A TIH CT GATGAGCCTG 

gctgaaagac ataaacgaaa SSSSSI SSgu Sc£5£ gac , gaaa tc 

ATCTTATCTA ACAGTTCATT AACTTTTATr *^TaYaSU£ 13 CAGCCT TTITCTATTr 

tgattaatgg rrrrSSS ££SJ£a ?aa?c^I?a S^S^F" gtcaa t™ 

ATTGATACTG ATTTATCG CT CTATTC^TTC aarlf^>?i GGAATATTAT GGTTAATTAA 

atggatttaa ataatgSS SSSSS aaaattaaat ^SSIr S2 ATAATAC 

AAATTCAACT CAACAATTGT TTAAATATTT ?TAaSgTGT ZPCATGAAA 
ATGACTAATA TTTATCTATG AAAGATTAT^ ~»wT2£fI H GTGCTGTT TGAAAAATGA 
GCTACGTTGG AGTCaSJS A^SSSI AEEKScJ «™»»GG 
CAAAAGTTGG TATATCGTGA CAAGAGTGAT ACTAATCtS TGC GTAATTA 
AACCTCGCAA ATGCGGGGTT TTTCTTCGCA rpl^l^, X2S?£ TCAATA CCCG 
TGATTACTCT TATTCTCAGT ACCCTTTCTT ^J^S GAAAGCTATG AAAAAAACAC 
TTTCCCCGGA CAGCAcSS gTSSSI S? 36 ™ 

CCAGCGTTGC TCAAGCAAAA TCTTTTCGTG ATGATGCGTG SSS^SJ CCCAACCTGA 
TTGTTAAACA GGTTGGTCAC GAACTCTATG AATTrrnS? ™JJE TTCTG GAAGGAAACA 
AGGGATCGCT TATTACGGAC T^SSS GACTCACTAT 
AAAACAGAAT TCAGGGATAA SStSgTTCT G^ATC^ A^S^T ACGCCTGAAT 
ATGGGTCTGA CGGCCACTCC AACTGACAAA G^TCni^rr ACA II? A T GA TAAGCGCTGG 
TGGAACAGTG TTGAAATTGA TGTCAAAACT raSSES ^a^I ^GACAAAGAC 
GAATATAGCC CCGCACTCGC GGGGTTTTTT rrmSS^ F?^^ 01 CAAGCACTTT 
29461 TGACGAGGAT CAAAACTAAG SXSSS JSSSS ™^ AAGT "^CCGTAG 
29521 AAGTTAAAAA TraaAfinT-PR iii^ir™ TGGTCACTGA TTTGGTGCAT AAGTTATCAA 

29581 ACgSgA^ ISJaSt aSS JaS^S 5 

CCTTTAAACT ATATTCGGGG GAAATTATTA TC^r^I^^ ^15^^ AATTGAAAAA 
ATAACAATTT TGGTTGTGAA TATAAaS 1^^?? JCGTAATATT ATTAATCTTG 
TACACCCATT TTTCTCATCC rrrr¥££S A ZH ATTTAA ATAAGTTTTC ATAATTGTGA 
TTTGACATGG SSSSIct SSSS5 SSSISr r^^^ CCATCAaSS 
ATGCAAGGAT TGCCATAGAC GTTCAATTTT IT^^r^ J?GTTTCGTG TCACGGTTTC 
GAGAAGATTA AATTTGGGAT TCTTTGCCAG cSS^C^ f^^l^ TCGG TAAAAA 
GCAATAGTTA TCTAAAATTA ACGTGATGGT S^^^ A H CTTCCGGC TCTTATCAAT 
ATCTAACAAT TTGATAAATA AaJSJIgS S^JSSr ^ A P 5AT TGTTAAriTC 
TTTCGTTTTC GCGTTGAGGC AaSSS StcS^ ^TGATTTC 
AACACGCTTT TGTTGCCCTT TGAAGCACCA 



27721 

27761 

27841 

27901 

27961 

28021 

28081 

28141 

28201 

28261 

28321 

28381 

28441 

28501 

28561 

28621 

28681 

28741 

28801 

28861 

28921 

28981 

29041 

29101 

29161 

29221 

29281 

29341 

29401 



29641 
29701 
29761 
29821 
29881 
29941 
30001 
30061 
30121 



30181 AACACGCTTT TGTTGCCCTT TO^G^ GTctJSS? C *^A^ 

30241 gtccacctca tcctcataga agaccgggS ^??^Sp? ^inS GGGT tcaggttcat 

30301 TGATTTTTGC CATTTTTrCA TCATActSg G^SS^ ^TP 0 ^ AACGTCTCGC 
"™ =A TC CCC=TC SSoggSS SSSS gSSggggg 
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TATTCAGTAG CTCATTGATT TTAAGTGTAA TAAGCTCAAG GCTCCATCGT GAACGGAGAT 
AG CCAAAATG TTGTGGCGAG TG CTGTAAT A AGAAAGAAAT GACTGTGAAG AGCGGAGCTA 
AGTTCCAGAT GGCAGGCCTT CCCGCCGGGA GG CTTTTAAG TCCTTCCAAC CCGTATAATG 
TTAACCAATT TACCCAACGA TGAACGGAAG AACGTGAACA GTGAAGCGTT CTGGAAACGT 
GAGAAACCGT ACTCCCTTCA TGTAACATCA AGAGCGCGGT GAAGCGACGT GCATAGTCCT 
TATCCCGGGT TTTCTGGATA GCTTTTTTCA TCGGACGTCG TTCATTTCGG GGTATTGATG 
TTATGATTGG CATGACTCAG TCCATTTTGG GATTTGTTTT GATTTGGCGA TTAATCAGAT 
CGCGAAAATC GGACTGAGTT CCCTTCAAGT GATCTACTAT TTTGAAATCT TATTTAATCA 
GGAGTCAGCA AATGAGTTAT TCCCCATAAT ACCTGACCAT GTGGTTGTTT ATCCGGGAAA 
TGATTCATCT ACCGGTGGTA TGTGGATTCC TTGGTGCGAT AGTCAGAAAG ATATTGACTC 
TGGCCATTAT ATCAAAGTTA CTTTCAGTAA AAAGGACGCT GCTGATATTG TGAACTACAT 
GTTTCAACAT GG CAGTTATG TTTATTTTAC AGACAGTAGT AAACAATTTA GCAATAAGCA 
AATTATGTCT GGTGATTCAG CTAAAGGCAA AGGGGATTAT AAGCTTGAAA TTAAAACAAA 
CGGGAACCTT CCACTGATGG TATTGAATAA ATATTGATTC ATTATTATTT ATGGATAAGA 
AATTAAGTTT ATATTTCATC TGGTTTCTGC AATTAAGTTT TAAAAATTAA TTCTACTTTT 
TTTATGGTTT TATATTTAAT G C CAATC ATA TTATTTTTCT TATAATAATT GATAGTTTAT 
TTATATAGTA AATAAATTCT GTTGGATGTG ATTATTATTG TGAGACGGTA ATAATTAACA 
TAACAGAAAA TTCATGGTTA GGAAATTCAA TCAACTTTTG TCCGGTTTCC TGACCATGAA 
GAGCTGTATT TACTGTAGAA CTCGCATTGA TA CTGG ATTG ATTAGCCGGA CGAGTGTTGG 
GTCAGCAGAT AATATGTTGT ATATTGGCTG TGGATTTTTC AGCGAGATGA TAGCTTTGGC 
AGTAAAGGCG ATTAATAACC GATAAAACAG AGAGACGGAT TGTGGCCAGG AAAGCAAAAA 
AGCCTCACCA TGACGCGTTA TTCAAACATT TTTTAACCCA ACCAGAAACC GCCCGGGAAT 
TTTTATCCCT TTATCTGCCG GAAGCGATCC GGTCAGTGTG TGATTTACCA CACTAAAACT 
GGAACCGGCA GCTTTGTGGA CAGGCAATTA CGTCAGTTGC ACAGTGATGT GCTGTATTCT 
GTCGAGACAA CCCACGGGGA CGG TTACATT TATTGCCTGA TTGAACACCA GTCCACGCCT 
GATCCGTTAA TGGCCTGGCG GCTGATGTAT TATTCGCTGT CAGCCATGGC TGCGCATCTG 
AAAAAAGGAC ATACTGAACT CCCTTTGGTC GTCCCCCTGC TGTTTTATCA TGGTGAGGTG 
AGGCCTTACC CTTACTCAAA TCGATGGCTG GATTGTTTTA CACTCTCTGA 'ACACGCGGCT 
CAC CTGTATA ATCAGCCCCT GCCGTTGGTG GATATCAGTG CGCTCAGTGA TGAAGAGATC 
CTGACACATA AAAGCATTGC CTTGATGGAG C7GGTACAAA AACATATCCG TTG CCGGG AT 
ATGCTGGAGT GGGTTCCCCA ATTGGTGGCG TTGTTGAATG CCGGTTATAA ~TAGCGCCGAA 
CAGCGCCATG TTGTGTTAAG CTATATTTTA CTGAATGGAC ATACGCTGGA TCTCGCCCAG 
TTTGTCCATC AACTGACTGA ACAATCTCCG GAGCATGAAA CCATGTTGAT G ACT ATTG C A 
GAA CAGCTTG AACAAAAAGG GCGTGAGCAA GGCCGGACAG AAGGCAGAAC AGAAGGCAGA 
GCTGAAGGAC GGGAAGAAGG CAAGCTGGAA ACGGCGCGCG CATTATTACG GCATGGTGTC 
AGTCTGGACA TCATTGTCAC CAGTACCGGC CTG AG CCGGG AGAAAATTGA AGCGTTAAAG 
CATTAAATGG ATACG CTTTT TCACAGCAGG ATATGGTGAC CCCTGTGAGG CCACCGG AAA 
ATTTTATTTA CTACGATTTA CGACGGGTTA CTTTAGGAAG CTGAATGAGA y CGTCCTTTGT 
TATATAACGG TCCCATATCA ATCTTCTCTT TTCCGCGTAC AGGTAAGTAA ^CCCAAACCTT 
CGTGAGCAGC ATTTGCCAAC AGGCCATCAT CCTGATCGCC TGACCAAGAG \aGATCCCGC 
CCAATTTCAT TTTGGTTGCA TAAATTCCCT TATGCAGCAC AGTGCGGGGC GTATCCAGTG 
AAATCCAGTG ACCACCGTCA GCATTAAAGA GTGCGTCAGC GTCGGTTTCC GTGTCTGTCA 
CCAGTTCAAA CTGATTTTTC CCGCGTGCAA TTTCATATTC CGCATCGTAT TGGTTATTCA 
GCAGACAGAA GAATTCCGGA GCACCTTTTT CCATCGTGCC CAGTGGCTCT CCTGTTCTGT 
TATAGCGGCG CGTTGTCAGA TCAGCACCCA GACATGAACG TCCATAGTTA GCAAATCCGA 
GGTGAATTTT CTCCGGTTGT ACACCTTGTG ACAGTAAAAA GCGGATCGCC TCATCTGCCG 
AGTAATCCAT GTCCCGATCA GGATTGGGCG GAGGAGGGTT ATCGCCGTCA TATT CATATC 
TGGGGGGATA CAGGTTAGTA TGGTGACCGA TGTATTCTGC CCAACCGGTA CCAAAGAAGT 
CGTAGGTCAT CACAAAGATA TTGTCTAAAT AAGGTGCGAT TTCTTTGAAG CTGGACTTCT 
CCATTTTGGC AACGACGGCG CTACAGGCTA TCGTGATTTC TTTACGGGCC CGGGTTCCAA 
AGGCGATGTT CAGTGCTTCA CGCAGCTCTT TCACTAACAA AACATAGTTT GGGCCATCAT 
GTTCCGGGTC GAATTCATTA CCTTCTTCAC CTGTGGCGCC GGGGTATTCC CAGTCGATAT 
CCACCGCAGT AAACATGGGA AAACGCCGGG AAGAAGTCGA CGATGCTACT CACAAATGTA 
GCACGTTGCT CAGGATCTTT GGCCATCACA GAGAAATACC CTGACATACT CCAGCCGCCG 
ATACTGAATG CGAGTTCCAG CTTATGCCCT GCCTGTTTTG CTCGCGCTTT CAGATTACGC 
AATCCCCCCA GTAAACCGGA GG CTG CATCC TGATTGTAAT ATTGCAAGAA ATTCTTCGGG 
CTGG CAT CAC GG CG CTG AT C CGCGTCCAGA CCGACA1TGC GTGTGGTGCC TAAATCACCA 
TAAGGATCAA CGGGTACAAT ATGGCCTAAT GTAATAGGGG CAATCTGGCC ACTGCTGGCT 
TCTGCTTGCC GGTTCCACCC GTCAACAACC TCATTAATCC GTTCGGATAA CTTGCCTTTG 
TCACCGTTGA CGG CCATAAA ACTGAAAATC AGGCGGTCGT AGGCGGTAGG CGGGATTTTT 
TCCAGATCAA AACCACGGCC GGGGGCATCG TCGCTGGTCA GCGCAGTGTT ATCCTGGGTT 
TCTGGCGACA AACGCGCATC ATACTGGCAC CAGTCAGTAA TATAGGCAGA GACTTTAGGC 
AGCGGTTCTG TATTTTCCGG ATCAA CTTCA TATT CG TTG T A CAGGG A CTT GG CAACACGT 
GCTGAAGAAT AACTCAAAGG . AGTTCCGCTG CCGTCAGGTT TATATCCCAC CTTCTGATAG 
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gJSSS S?K?3iS Jg?SS" ™ ccc <* cggtacatca 

CAGGCATTCT CATAAlSSS XSSSEE S2S£g? SSSsSS ^ ATTGC 
TCAACCCAGC CGATGTTTTT AAAAACCGfv: rrirrjT»»» ^GTCGGOAAT ATG CCAGCGT 

^e^AT tctSSSJ a^JgSS SSS££ SJSSSSg SS^ca 

GCTTTAATTG AAGGGTATCG ATAAACATTT TTAr nr£™ * i^SIrEZE 00 GTCAGACATC 
TATTCCGGGG CCGGCTCCTG ATATC^S SSS^ GGCCCCGTTA 
AG A CGGCTAC GAACCTGCTG GCTGAACTCA TTACTTcS? rl^^T^ ATGTTTATTC 
TAACGCAGAT GGAGGATAAT ATCGCTCAGC GACrSfrrf S^I^^IH ACGCGCGGTA 
AATTCCAACT TcdSSS SSSgJ 25S52? ^^ CG 

TCAAAATTGA GCCTGAACAT GCCGCTGTCT TCraTnr^rP T^f^SS** ^^TCATCA 
TTAGCCTGTA CGTTCAGCAA AACGTTTTCG SSSSS? T£^™ C CACACCTTCA 
TAATCGATAG TTTTTAAGTC A G Sg^TC SUSS? JSSS?^ KSiSSS**- 
GCCCGTACAT CTTCATAAGG CCCCAGCAAT GmfirraaT? T^nAS^I? 3 TAC CAGTGAA 

cgccgatcag cgtgggtcgg 3253i?? cScSSg ?S?HT£ TA 

AATGAACCCG TACTCTTGCC AATTTCCCAC TGTGATGATG TCACTaTS^ Z^S™ 3 ™ 
ATGGTTTTTA TGATCTCCAG ACGTCTGGTC TTATGTTGCA ISaSJSp IZH A £ GG . AT 
TGTAAGGCTA ATTTCAGATG TTCTCCGACC ACrlrrrrrZ r^Z^S?l G ATCCATCCGT 
CCACTTTGGA CGAAATTCAT SSSSS CcSScS SSJJSS SSSS 6 * 
AGTAAACAGA GGGAATTAAC CGCATCATAG GCTTGCAGGT mIgc^C ?S£EIS5£? C 

sssssi iasisss ssss isssss —s ~ 
ssssss ssses jssgssg S iiE liis 

TAGTGTTGCT GGCGCTCCTG AGCGGCCCGT CGGCTGATGG TCACTCaS? rt^nrT^ 
TGTTGCATGT CAATCGCTTG CTGTTGCAGA TTG CGGG TAA AGCTCTaJS SJSS^ 
TGCTGCATAC GGAAGTGTTC AAAATCGGTA TTGTCTTTTT TCTCCAGCAa ?^^I? GC 
GTGCTGCCGT ACTGAATCAG CGTTTCTGCG GCCTCnTTC r£?££^S 
AAACGATAAT TCGGGATTGC CCGGCGTITC ATCCcScS X?25S£S 
TGGTAACGCT GCCTGAGCAG ATCTTGCGGG CTG ATGGGTT CATCgSJXJ 
3 6 061 AACTCTTTAC CATCCAAGGT CAGGTTATGA CGTAAGTTAT ATAcScgSS T££™*™ 

36121 tgccacagtt tgagatattc cgtatcaaca ggtttgacaa a?aS??aS SK^SJT 
36181 gagacggatg tatcatatgt cacaggcaga agtggcacgt tcctgISS SrrS^ 

TCCTGTGCCC GTGCTTCACT GTTTTCATAC AGAGCCACAT CTtcSg^? ^r3J^n 
CAGTTTGCCG CGAGCAGAAT ATCAGGGCTG GTACCCAGTA ACATATTGA? 
ATCTGCTTGG CGACAGTACG TGCACTGGAT GTCAGCTTAC GGTAT-Ar^ SS^ TCATAG 
TCTAACAGAT TCTTGACATA GAAACGGAAT ATTGCTTTC C gSa^^t SIH£ CCTGA 
SS^JS 0 ^^TC GGTTGGTTCA aS£SEtc G^SSSSS? gSSS? 
1^™™ GCCGTGA ATT CCAGTAACGC GGTTTACCTT GGTTGCTGGC raSXSS 
TCATCTTCCA GCGGATTAAA AATATAGTG C AGCCATTCGG TGGCCTCT?t SI^^H 
TCTATATTCA GTCGCCACGC GACCAGAAAT GGCATATGGA AaSSSS? ^^a^J 
ATCCCATTTG CGCCATTTAA ATCAATCGGC GTAGGGAATG AACaSK-25 ^?^E AG 
f TAATAAGCT GTGTATTC ^ GCTCAGTACC TCcSsKJl? SJS£S£? £1S£S£? 
AGTTTTTTTG CAAACAGTGT ATTAAGGCGA ATGTTTTGTG GCGCGTTVT^ 7^2,^ C 

gcggggaagg aaaggaattg cacctgatcc tgttSttga g?S£3SE 

TGCATACCGA TTCTGAACTC TTGAGTACAG CTGGCACTTT CATTGcSS? Urar^^ 
GG CTTAAAGA GAAGTTCGGC TTTCAGGGTG ATTCGATTAT CTOArrrr^ A ££ A ? CriTG 
GGATAGGTTA AATCAAGAAC TTTTTCGC'^'C AGTACCAGTG S5^°*G S TTGATTGAT 
TTATCGTGCA TCAGCCGGAA aSUcG^G JSSStS? GATCTTCTAT 
TTAAAGTCAG ATTGAGCGAC AATCTCCAGT GTGTCATCAG TGCCAtSa? SSSS^ 

^™ tactgtctit gccgaaatca gggttcattc Swires!? toSSX 

TAGGAAAGCG TTCTTCCCGG GTTGCCGGAT AGAGCACCAT ACTArr£-?a* Izt C S GGCAA 
TGCCTTAAGG CATCCTTGTG TTCACGTGAG TAATACCAGA Sg^S 
CCTTTTCGTC CATCAGCATA TTGGTCATCC GGCAAATCAG TAATTTCT?? r^I™ 
TCGCAGACAT AACCGAAGGC TTCGTCATAA TCATAATCCT TAcSJS? SS£S2£ A 
TGAAGACGGA CAAACGGAAC CAGAGCCAGA AACGGGTTAT GCGgSSJ rl^tl^ 
ATCACAGCAA CCATCTGGGC CATCCGGTAT TGCAGATGTC TTCGcArl^ F^I^ 700 

tactccagct gccatcatat ttcgcataag cgattttga? SgS?cS^ A A r^^Z G 

AGGAACCCAA TCACCCGCAC TAGGCTCAAC GTTTTGGTTA TGCAcS^ AGGGTGTGG G 
ATCTTTAGTT TCAGA CTGTT CTTCAACTTC CGTCCaSS It^SS^C r^™ 1 
GAAAATGGGG CGTATCAAAT TGGGGTCTAC GCTGCCCAAT GGCAOr^^n S A J^ 
CTCGCTCCAG GCATTGGGAG ATAACGCATC (SGTATCAgJa ?SctI? AA 1^™" 
TGAACGCCAG TAATATTGGT ATGGCTGTGT ACGGGTACGT cr^arl^ 0 AAAGATTCAG 

gcgtttgatg ttaacaccat cttcataacc ^Sfe? 55S^ A JSSggJ? 
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3 8101 AAAATTATTC AGATAACCGA GCACCGCTTG TTGTACAGAA TCTTCGGTAA TTTTTCCCTG 
3 8161 ATTAAGGGCA CTTTCCAGTT GGAAGAAGAA TTCTGTTTTA TTCAGG CGTA ACAGGGGTTC 
3 8221 CAGATAGCTT TCCGGATAAG TCCGTAATAA GCGATCCC 



N=unspecified base 
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